StatCounter User Forum  
StatCounter Free web tracker and counter

Go Back   StatCounter User Forum > Webmaster > Lounge (non-StatCounter related topics here!)

Closed Thread
 
Thread Tools Display Modes
  #1  
Old 07-01-2006, 12:17 PM
Animation Animation is offline
Senior Member
 
Join Date: Jun 2005
Location: UK
Posts: 814
Default How do SEs work so fast?

As an experiment I did a search for the single letter 'a' on Google.
It gave 25,130,000,000 results in .28 seconds.

How is this high speed achieved in all it's indexed pages world wide?
Do all the data centres muck in and each do part searches for 'a', or does just one data centre achiever this speed?
Sorry to be so ignorant!
  #2  
Old 07-01-2006, 03:51 PM
China Tea China Tea is offline
Master Member
 
Join Date: Oct 2004
Location: California, USA
Posts: 1,822
Default Don't have the answer

But I would imagine, that everytime you search, results would come from one data center only. Therefore each data center would be conducting their own separate indexing(?) and most likely bringing it to you on its own without the help of other data centers(?)

The learned and the knowlegeable can correct the above.


BTW,
1,800,000,000 for china (0.08 seconds) - and everything was not about me?
  #3  
Old 07-01-2006, 04:05 PM
webado's Avatar
webado webado is offline
Moderator
 
Join Date: Apr 2004
Location: Montreal, Quebec, Canada
Posts: 28,173
Default

Those searches are done in a quick index. That quick index already knows how many ocurrences are for a word (approximately, as of the last full index update) because that number is stored alongside the word itself - without actually going about retrieving any information. Then Google only retrieves the first 1000 link maximum. Other search engines will retrieve a different maximum number of links.

(* Disclaimer: Ok, I'm speculating about the actual storage and retrieval mechanism of that index, but it's based on what I know historically of how indexing works in general. My knowledge may be obsolete. *)


And yes, the results are from the particular datacenter you landed on.
__________________
Christina
>>Forum Moderator<<

Please do not PM me for support. The forum is here for that.

Last edited by webado; 07-01-2006 at 04:16 PM.
  #4  
Old 07-01-2006, 05:05 PM
jonra01 jonra01 is offline
Master Member
 
Join Date: Feb 2005
Location: Mississippi
Posts: 3,607
Default

I think Christina's guess is pretty close to the way it works. There are probably some pretty hi-tech solutions in place also. Like server clustering. Still pretty amazing how this all works together.
  #5  
Old 07-01-2006, 11:21 PM
robinev robinev is offline
Senior Member
 
Join Date: Jul 2004
Location: Seattle WA USA
Posts: 906
Default

Google built their own databases (at least originally) instead of trying to customize any of the off-the-shelf dbs. It was (and, presumably, is still) optimized for the kinds of searches they do.

There are a lot of hints out there about how they manage to run multiple processes with just a limited amount of information to start with, but, once they've parsed the query string (split it into its components) it all has to be sent to the kinds of "inverted indexes" that Christina mentioned.

It sounds to me (and I'm also just guessing, based on what little I've read and actually understood), that they send each word in a query off to a separate process that brings back matches, which are then ordered by yet a different process using different rules.

My guess is that Microsoft uses some kinds of highly cusomized versions of SQL Server for their searches. It's notable that they spent years developing the version they called "TerraServer" which was (and is) used for mapping, but it was also research project that allowed them to make sure that the DB app could quickly handle terabytes of data from multiple thousands of simultaneous searches.

I've never heard about what Yahoo! uses, but they bought AltaVista before launching their own search engine. AltaVista started out as a research project by the late Digital Equipment company, using custom db solutions on their (then) super-fast chip (which isn't made anymore). I suppose it's possible that Yahoo! still uses some of what the Digital folks developed.
  #6  
Old 07-02-2006, 12:52 AM
rotarysteve rotarysteve is offline
Master Member
 
Join Date: Aug 2005
Location: Ohio
Posts: 1,713
Default google is the enemy, so to speak.... lol

Quote:
Originally Posted by Animation
As an experiment I did a search for the single letter 'a' on Google.
It gave 25,130,000,000 results in .28 seconds.

How is this high speed achieved in all it's indexed pages world wide?
Do all the data centres muck in and each do part searches for 'a', or does just one data centre achiever this speed?
Sorry to be so ignorant!
they have the resources, money and all of that there shtuff to put to shame the competition.

Trying to stay happy happy, but google, and admittedly yahoo "is" the evil enemies....


Guess not too happy happy here...........
  #7  
Old 07-02-2006, 07:23 AM
Animation Animation is offline
Senior Member
 
Join Date: Jun 2005
Location: UK
Posts: 814
Default Storage.

Thanks for all the replies.
I am now tempted to ask how it is all stored? Do they just have
mega numbers of hard drives? It would need hundreds of thousands of them surely, even with the biggest capacity available.

Has nybody seen around a data centre? Are the HDs all on racks in air conditioned rooms or installed in a
big PC ......1milex1milex10 miles size?
  #8  
Old 07-02-2006, 07:39 AM
webado's Avatar
webado webado is offline
Moderator
 
Join Date: Apr 2004
Location: Montreal, Quebec, Canada
Posts: 28,173
Default

This is not from Google, obviously, it's from The Planet - a big datacenter in Texas: http://www.theplanet.com/facilities/tour/index.html

But you get the idea of what facilities might look like.

I think I saw a Google presentation once.
__________________
Christina
>>Forum Moderator<<

Please do not PM me for support. The forum is here for that.
  #9  
Old 07-02-2006, 08:02 AM
faze3 faze3 is offline
Senior Member
 
Join Date: Feb 2005
Location: N. Wales, UK
Posts: 932
Default

Be no good for you and your ciggies in there, Christina (or mine come to that!) Quite an amazing tour.
Alan.
  #10  
Old 07-02-2006, 09:04 AM
JWJ JWJ is offline
Moderator
 
Join Date: Jan 2004
Location: England
Posts: 5,772
Default

Impressive.
__________________
... John ...
Closed Thread

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 04:05 AM.


Powered by vBulletin® Version 3.8.5
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.