Nova Scotia Flag

SCOTIA SYSTEMS BLOG




Is The Google +1 System Getting Gamed?

July 26th, 2011 admin

image

On July 17th, Plussem.com launched their website offering to provide fake +1s for your website for a fee.

image

Like buying links, this service offers fake user endorsement of your site – potentially improving your ranking in the Google search engine.

It’s hard to know if Google will be able to detect this without knowing how Plussem actually generate the +1s? 

I’m sure Matt Cutts and the Google web spam team are working on ways to detect and ignore these fake endorsements, but if the spamming grows, it may affect how much +1s affect the ranking signals.





Google Adds Instant Preview to Webmaster Tools Labs

May 19th, 2011 admin

When Google added Instant Previews to user search results, there were questions as to how and when the images were created:

preview

 

Google have now added an additional feature to the popular Webmaster Tools (under Labs) to help website owners find issues which may be affecting the preview image generated by Google.

The tool gives you the option to specify any web page on your site and then generates a preview on the spot.   It compares the preview to the actual site side by side:

 

image

 

As you can see above, there are some small discrepancies (some borders/menu elements are missing), however this appears to be fairly random?  

Repeatedly hitting the compare button gives different results – possibly indicating a bandwidth issue when downloading pictures?   Maybe because so many webmasters are hitting the new tool at the moment!

Below the comparison, the tool provides a list of errors encountered when processing the site.    In the above example, the missing bitmaps are listed as “fetch failure”, which is odd as the images download fine for normal browsers.

This also raises the question – if the image on the left of the actual site is also generated in real time for the comparison – how come it succeeds to download all the pictures, whereas the instant preview doesn’t?

Google states that the image preview is generated using a WebKit engine which doesn’t support flash or Silverlight which may account for differences between the real site and the preview.

 

Check out the Google webmaster central blog for the official Google announcement.





The Funny Side Of robots.txt Files

December 17th, 2010 admin

 

History of Robots.txt

robot

Photo Credit http://www.flickr.com/photos/tamaleaver/

 

If you’ve never heard of robots.txt files before, they’re simply a  text file placed on a web server containing a set of rules to control how web spiders crawl a site.

Developed in 1994 by Martijn Koster, robots.txt files can now be found on most websites, however they’re rarely viewed by humans.

However did you know there’s a lighter side to these simple files?

For example, Malcolm Coles recently made a discovery when analysing the Daily Mail’s robots.txt file. Hidden inside was this secret message:

# August 12th, MailOnline are looking for a talented SEO Manager so if you found this you’re the kind of techie we need!

According to Malcolm as of August 25th, his post had received 5,500 views, so I suspect the position at the Daily Mail was quickly filled!

 

.

Google  Zombies

zombie

Photo Credit http://www.flickr.com/photos/kwl/

In October 2008, Google slipped a hidden message into the rules of their robots.txt file for Halloween:

User-agent : zombies

Disallow : /brains

The change was covered by Google’s Web Spam expert Matt Cutts in his personal blog  

http://www.mattcutts.com/blog/google-protects-itself-from-zombies/

 

 

Robot Reforms?

whitehouse

Photo Credit http://www.flickr.com/photos/dcjohn/

Now moving on to politics, did you know that when the Obama administration moved into the White House, they pruned the www.whitehouse.gov robots file down from nearly 2400 lines down to just:

User-agent: *

Disallow: /includes

Today the file now contains just the following:

User-agent: *

Crawl-delay: 10

Sitemap: http://www.whitehouse.gov/feed/media/video-audio

As a comparison, you can view an archive of the original Whitehouse Robots.txt

 

 

Asimov’s 3 Laws of Robotics

robots2

Photo Credit http://www.flickr.com/photos/gladius/

Quite a common tactic with robots files is to include Asimov’s 3 Laws of robotics.   Both Yelp.com and Last.FM have included Asimov’s 3 rules for Robots in their files:

Yelp.com

#

# 1. A robot may not injure a human being or, through inaction, allow a

# human being to come to harm.

#

# 2. A robot must obey orders given it by human beings except where such

# orders would conflict with the First Law.

#

# 3. A robot must protect its own existence as long as such protection

# does not conflict with the First or Second Law.

 

Last.fm

Disallow: /harming/humans

Disallow: /ignoring/human/orders

Disallow: /harm/to/self

 

Hidden Whisky

whisky

Photo Credit http://www.flickr.com/photos/bignavijp/

Another find by Malcolm Coles!   This time on the Whyte and Mackay website.    W&M decided to run an online promotion in October 2010, giving away 250 bottles of 30 year old whisky worth £150.

The special bottles were hidden in stores across the UK, however the first person to read the robots.txt and send an email also won a bottle (don’t bother it’s already been won!).

You can see the robots.txt with the competition at Malcolm’s blog:

http://www.malcolmcoles.co.uk/blog/whisky-hidden-in-robots-txt-file/

 

 

Bender and Gort

bender

Photo credit http://www.flickr.com/photos/josek/

Finally,we come to the robots.txt file for the site www.reddit.com.    Hidden in the text are instructions to two popular robots, Futurama’s own Bender (shown above), and Gort from the 1951 film, The Day the Earth Stood Still.

(photo credit http://www.flickr.com/photos/narcosislabs/)

gortreddit

Let us know in the comments if you find any other funny or odd robots.txt files!