![]() |
|
|
|
|
|
|||
![]() |
|
|
|
Reader Feedback - Website Highjacking Whenever I'm at trade shows, run seminars, or speak at symposiums I am asked the question "what is the Google dance?" I've heard a few different theories regarding "the Google Dance", but only one is really correct. It's the period when Google is rebuilding its rankings, and results fluctuate widely for a 3 to 5 day period.
Because Google, like every other search engine, depends on their customers knowing that they deliver authoritative reliable results 24 hours of the day, seven days a week, updates pose a serious issue. They can't 'shut down for maintenance' and they cannot afford to go offline for even one minute. Hence, we have the Dance. Every search engine goes through it, some more or less often than Google. However, it is only because of Google's reach that we pay attention to its rebuild more than any other engines' During this period, the index is constantly in flux, and search results can vary wildly, because it is also during the Dance that Google makes any algorithm adjustments live, and updates the PageRank and Back Links for each web site it has indexed.
Do Search Results Only Change
During The Google Dance? Because Google is constantly crawling and updating selected pages, their search results will vary slightly over the course of the month. However, it is only during the Google Dance that these results can swing wildly. You also need to consider that Google has 8 data centers, sharing more than 10,000 servers. Somehow, the updates to the index that occur during the month, and outside of the Google Dance have to get transferred throughout. It's a constant process for Google and every other search engine. These ongoing, incremental updates only affect parts of the index at any one time.
Checking the Google Dance
· www-ex.google.com - (where you get when you type www.google.com) During the Google Dance, you can check the 8 Google servers, and they will display sometime wildly differing results, thus they are said to be "dancing", and hence the name "Google Dance". The easiest way to check if the Google Dance is happening is to go to www.google.com, and do a search. Look at the blue bar at the top of the page. It will have the words "Results 1 - 10 of about 626,000. Search took 0.48 seconds". Now check the same search on www2.google.com, and www3.google.com. If you are seeing a different number of total pages for the same search, then the Google Dance is on. You can also check all the variations above. www2 is really www-sj, and www3 is www-va. We have found that all the others need their full www-extension.google.com in the url if you want to test them properly. Once the numbers, and the order of results on all 8 www's are the same, you know the dance is over.
Importance of The Google
Dance
The Technical Background of the
Google Dance Google uses Linux servers. When the rebuild happens, all 10,000 of these servers are updated. Naturally, there will always be some variation from one index to the next - just because there always are new sites being added, and content changes being made that affect the placement of some websites. But during the Google Dance, these variations are dramatic. One server after the other is updated with portions of the new index, until eventually, they are all updated with a completely new index database.
Google Dance and DNS Google uses multiple data centers to get results to the end user faster. If you access a data center that is physically close to you, then in theory, your connections need to make less hops - or navigate less intersections - to get to the data center and back. Each data center has its own IP address (numerical address on the internet) and the Domain Name System (DNS) system manages the way that these IP addresses are accessed. The system instantly routes your request to the nearest, or least congested data center. It's then routed within that data center facility to an idle server. In this way, Google is using a two step form of load balancing by its use of the DNS tables and then internalized traffic management. Therefore, the distance for data transmissions can be reduced and the speed of response improved. During the Google Dance period, all the servers in all the data centers cannot receive the new index at the same time. In fact, only portions of the new index can be transferred to each data center at one time, and each portion is transferred to one after the other. Different portions are uploaded to each server farm at different times, which also affects results. When a user queries Google during the Google Dance, they may get the results from a data center which still has all or part of the old index in place one minute and then data from a data center which has new data a few minutes later. From the users perspective, the change took place within seconds. Building up a completely new index every month or so can cause quite a bit of trouble. After all, the search engines have to spider and index billions of documents and then process the resulting data compiled into one cohesive unit. That's no small feat. During the period outside of the Dance, there may also be minor fluctuations in search results. This is because the index at the various data centers can never be identical to each other. New sites are constantly being added, old ones deleted, etc... It is estimated that over 8 million new web pages are created every day. Some of them are added to the search engines, and thus affect search results.
Now, if you want Google's definition of the Google Dance visit their page about the
Google Dance. Looks like fun, I'd go!
About The Author
![]() Printer Friendly Version of this Article
|
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
|
|
| SiteProNews - The Net's most widely read Webmaster newsletter | |
|
|