Trouble No.016
Today’s topic is about Google!.
Google! most important part of the internet. Even people use google for checking their internet connection as well!! 😀 😀
So, Today we will see How google search engine actually works!!!
Google search performed actual 3 essential steps. Those are,
1. Crawling and Indexing
2. Algorithms
3. Fighting Spams.
We will see above steps one by one :
1.Crawling and Indexing:
Finding information by crawling
Google use software known as “web crawlers” to discover publicly available webpages. The most well-known crawler is called “Googlebot.” Crawlers look at webpages and follow links on those pages, much like you would if you were browsing content on the web. They go from link to link and bring data about those webpages back to Google’s servers.
Organizing information by indexing
The web is like an ever-growing public library with billions of books and no central filing system. Google essentially gathers the pages during the crawl process and then creates an index, so we know exactly how to look things up. Much like the index in the back of a book, the Google index includes information about words and their locations. When you search, at the most basic level, our algorithms look up your search terms in the index to find the appropriate pages.
To explain how indexing is done,we uses the search term ‘Katy Perry’ as an example:
“In a document, Katy Perry appears right next to each other. But what you want in an index is which documents does the word Katy appear in, and which documents does the word Perry appear in? So you might say Katy appears in documents 1, and 2, and 89, and 555, and 789. And Perry might appear in documents number 2, and 8, and 73, and 555, and 1,000. And so the whole process of doing the index is reversing, so that instead of having the documents in word order, you have the words, and they have it in document order.”
follow video by google webmasters! watch here.
2. Algorithms
Algorithms are computer programs that look for clues to give you back exactly what you want.
For a typical query, there are thousands, if not millions, of webpages with helpful information. Algorithms are the computer processes and formulas that take your questions and turn them into answers. Today Google’s algorithms rely on more than 200 unique signals or “clues” that make it possible to guess what you might really be looking for. These signals include things like the terms on websites, the freshness of content, your region and PageRank.
To know how search algorithm work,watch here.
3. Fighting Spam:
Every day, millions of useless spam pages are created. Google fight spam through a combination of computer algorithms and manual review.
Spam sites attempt to game their way to the top of search results through techniques like repeating keywords over and over, buying links that pass PageRank or putting invisible text on the screen. This is bad for search because relevant websites get buried, and it’s bad for legitimate website owners because their sites become harder to find. The good news is that Google’s algorithms can detect the vast majority of spam and demote it automatically. For the rest, Google have teams who manually review sites.
Watch here, Matt Cutts explains how google fights spam.
And that’s how Google search works!
Behind your simple page of results is a complex system, carefully crafted and
tested, to support more than one-hundred billion searches each month.
by Google.Inc