際際滷

際際滷Share a Scribd company logo
Searchbots:
Lost Children ... or ... Hungry Psychopaths?
What Do Searchbots Actually Do? (and why it matters)




                       息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Googlebot ... may be unable to completely index
  all the content on your site



                      息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Doesnt tell us what searchbots index




                      息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Webserver Logfiles ....




           息 2007-2012 Roland Dunn
Webserver Logfiles ....




           息 2007-2012 Roland Dunn
Webserver Logfiles ....




           息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Over 2 months, 40% of all Googlebots requests
were to . just 2 URLs (on-site search URLs)




                     息 2007-2012 Roland Dunn
Over 2 months, 40% of all Googlebots requests
were to . just 2 URLs (on-site search URLs)

On a website with approximately 40,000 URLs in
Googles index (allegedly, according to site:)




                     息 2007-2012 Roland Dunn
Over 2 months, 40% of all Googlebots requests
were to . just 2 URLs (on-site search URLs)

On a website with approximately 40,000 URLs in
Googles index (allegedly, according to site:)

On a website serving approx. 150-200K unique
natural search visits/month


                     息 2007-2012 Roland Dunn
Over 2 months, 40% of all Googlebots requests
were to . 2 URLs.

On a website with approximately 40,000 URLs in
Googles index.

On a website serving approx. 150-200K unique
natural search visits/month


                     息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Categorise URLs by top level section  perhaps more
  useful than just URLs ...




                         息 2007-2012 Roland Dunn
Table of top level sections requested by Googlebot




                        息 2007-2012 Roland Dunn
Chart of top level sections requested by Gbot  6 months




                         息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Does Googlebot request all URLs served by a website?




                           息 2007-2012 Roland Dunn
Does Googlebot request all URLs served by a website?




                                                     No!



                           息 2007-2012 Roland Dunn
How Does Googlebot Spend Its Time? What Does it Request?




                           息 2007-2012 Roland Dunn
Does Googlebot spend its time cost-effectively?




                           息 2007-2012 Roland Dunn
Does Googlebot spend its time cost-effectively?




                                                     Not always.




                           息 2007-2012 Roland Dunn
Googlebot does not always request all content




                      息 2007-2012 Roland Dunn
Googlebot can become distracted, obsessed, or even lost:




                          息 2007-2012 Roland Dunn
Googlebot can become distracted, obsessed, or even lost:
 On-site search
 Additive filters/faceted URLs e.g.
  shoes?size=3&colour=green&price=10&brand=smith
 Sections with thin and/or very similar content


                          息 2007-2012 Roland Dunn
Googlebot may not spend its time efficiently
It needs help focusing on what we value



                      息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
SMX West, March 2011, Matt Cutts:
   (http://goo.gl/ZZz7E):
... if Google determines a site isnt as useful to
   users, they may not crawl it as frequently"

                        息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Conclusions:
 Searchbots do not always request all content
 Searchbots can become distracted
 Searchbots may use their time inefficiently
 Their visits are precious  we need to treasure them
 We may need to help them focus


                              息 2007-2012 Roland Dunn
Suggestions:
 Embrace logfiles  full of useful information




                               息 2007-2012 Roland Dunn
Suggestions:
 Embrace logfiles  full of useful information
 Check searchbot behaviour




                               息 2007-2012 Roland Dunn
Suggestions:
 Embrace logfiles  full of useful information
 Check searchbot behaviour
 If distracted, lost, inefficient, poor experience:
    Alter internal navigation & linking (e.g. flatten hierarchy)
    Robots.txt out (blunt approach)
    Alter URL construction (expensive!)


                                息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Lost Children ... or ... Hungry Psychopaths?




                        息 2007-2012 Roland Dunn
Or .... Distracted Teenager?




             息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Sonification: What do Search Engines Requests Sound Like?




                          息 2007-2012 Roland Dunn
Sonification: What do Search Engines Requests Sound Like?




                          息 2007-2012 Roland Dunn
息 2007-2012 Roland Dunn
Online Branding:                               Natural Search:
http://www.refinedpractice.com/                http://www.cloudshapes.co.uk/
T: @RefinedPractice                            T: @roland_dunn




              際際滷s Available At:
              http://www.cloudshapes.co.uk/talks/




                                  息 2007-2012 Roland Dunn

More Related Content

BrightonSEO: SearchBots: Lost Children or Hungry Psychopaths? What Do Searchbots Actually Do?