I'm Feeling Lucky_ The Confessions of Google Employee Number 59 - Douglas Edwards [20]
"The second half is how we determine which results are most relevant to the specific query we've received. Most of our competitors look at basic stuff like how many times a word appears on a page. We look at what we know about how sites use that term on their pages. What words appear next to it? Is it in bold type or a different font? How does the term appear in links pointing to those pages? That link analysis is really important. The words in the link pointing to a page are called anchor text."
A chain grew from a corner of my clown's mouth and fell to the bottom of the page, where an anchor suddenly appeared surrounded by grinning fish with barracuda teeth.
"How well we match the query determines our search quality," Craig went on, "which is not an exact science, since evaluating whether a query is a good match is somewhat subjective. If you searched for 'jaguar,' did you mean the car or the cat or the football team? Sometimes it's difficult to disambiguate a query like that."
I wrote down "disambiguate," and said it silently to myself three times so it would become my word. And I drew something that looked vaguely like a spotted jungle cat chasing the fish around the anchor, then added bubbles since he was underwater.
"Once we determine the order of the pages we want to show, we need to serve the results back to the user who submitted the query. That's where gwiss comes in." As he said "gwiss," Craig wrote "GWS" on the whiteboard. Beneath it he wrote "Google Web Server." "Gwiss is the software that actually interacts with users when they submit a query and when we serve results back to them. When we want to update how Google looks to users, we need to push out a new gwiss to implement those changes."
I couldn't think of what a gwiss might look like, so I sketched some Swiss cheese behind the clown head. By the time Craig had finished, I had a broader understanding of the way Google worked and a bizarre doodle to add to my collection of things not to share with my new coworkers.
Later Urs confirmed that Google had kicked butt in search quality even before Larry and Sergey left Stanford in 1998, because link analysis was an alchemist's stone for turning web dross into gold. Google's relevancy lured in early adopters and the media, but behind its beguiling look lay an arthritic infrastructure in danger of collapsing. "The ranking beat AltaVista by a mile," Urs told me, "but it was slow and we couldn't build an index reliably."
The challenge of improving Google's crawling, indexing, and serving systems was what had drawn Urs to the company. He'd figured the project would take about a year and then he'd move back to Europe. "I underestimated how much of a systems problem this whole thing was," he confessed. "We had a university system and we needed to basically rewrite the whole thing from scratch." While Google did a good job with the data it had, it collected far too little and wasn't searching through it fast enough.
Speed or scale. Pick one. When we crawled more web pages, the index got bigger and the pageranker had more data to draw upon, so we could produce more-relevant results. That attracted more users and more searches, so our audience grew. A bigger index, however, required more machines doing more processing, and more processing took more time. Adding users puts more demand on the network, which, as anyone sharing an Internet connection knows, slows things down.
As they did when forced to choose a future for Google as a destination site or a technology provider, Larry and Sergey chose both. Google's quest would be to get faster even as it expanded in all directions. They went looking for others who shared their disdain for the limits imposed by nature's laws.
Urs to Reason Why, Then to Do or Die
"Urs was mostly setting the goals at that time," early engineer Jeff Dean explained to me, "because Larry and Sergey were doing more stuff on the business side. They were more involved in deals and that kind of thing." Larry and Sergey could write code, but it wasn't what they were best at.
"I didn't trust