My favorites | English | Sign in

Let's make the web faster

Web metrics: Size and number of resources

Author: Sreeram Ramachandran, Software Engineer
Last updated: 26 May 2010

Here are some statistics about the size, number of resources and other such metrics of pages on the world wide web. These are collected from a sample of several billions of pages that are processed as part of Google's crawl and indexing pipeline. In processing these pages, we not only take into account the main HTML of the page, but also discover and process all embedded resources such as images, scripts and stylesheets.

Highlights:

  • The average web page takes up 320 KB on the wire.
  • Only two-thirds of the compressible material on a page is actually compressed.
  • In 80% of pages, 10 or more resources are loaded from a single host.
  • The most popular sites could eliminate more than 8 HTTP requests per page if they combined all scripts on the same host into one and all stylesheets on the same host into one.

Caveats:

  • All resources are fetched using Googlebot, which means they are subject to robots.txt restrictions. For example, some sites (such as the BBC) block CSS and JS.
  • Some sites may present a different view of the resources to Googlebot than to regular users. For example, until recently, Google's own servers used to serve CSS and JS uncompressed to Googlebot, while compressing them for regular user browsers.
  • Pages are rendered and subresources are discovered through the eye of WebKit. If a page serves resources differently for Internet Explorer or Firefox, those won't be visible here.
  • Sampling of pages for processing is not uniformly random or unbiased. For example, pages with higher PageRank are more likely to be included in these metrics.

Older stats