A patent for “systems and methods for improving the ranking of news articles” was granted to Google yesterday. It was originally filed in 2003.
The patent appears to provide a look at some of the factors Google may take into consideration when ranking articles for Google News. As Bill Slawski of SEO By the SEA points out, it is most likely missing a significant amount of elements that Google takes into consideration today, and some of the ones mentioned may have changed. It is after all, six years later. The web is a completely different place, and so is the news industry.
The abstract for the patent is as follows:
A system ranks results. The system may receive a list of links. The system may identify a source with which each of the links is associated and rank the list of links based at least in part on a quality of the identified sources.
The patent takes a few liberties that would undoubtedly be heavily criticized in the blogosphere. “For example, CNN and BBC are widely regarded as high quality sources of accuracy of reporting, professionalism in writing, etc., while local news sources, such as hometown news sources, may be of lower quality,” it says. Based on the query performed below and others, it is easy to tell that Google does give some weight to local sources.
In one “implementation consistent with the principles of the invention,” here are some factors that are mentioned:
– a number of articles produced by the news source during a first time period
– an average length of an article produced by the news source
– an amount of important coverage that the news source produces in a second time period
– a breaking news score
– an amount of network traffic to the news source
– a human opinion of the news source
– circulation statistics of the news source
– a size of a staff associated with the news source
– a number of bureaus associated with the news source
– a number of original named entities in a group of articles associated with the news source
– a breadth of coverage by the news source
– a number of different countries from which network traffic to the news source originates
– the writing style used by the news source
We can’t consider these factors to be the absolute law for what Google uses to determine its news article rankings. Unless Google comes out and says “this is it,” we need to just take it for what it is – the partial contents of a six-year old patent filing.
That said, the factors are worth considering when trying to determine how Google News ranks content. There is much more information to digest in the patent, so if you’re feeling up to it, you can take a look at it here. Slawski also has a pretty good (and easier to read) analysis of it.