Adam Fendelman

Adam Fendelman

Posted October 10, 2008 | 07:17 PM (EST)

Why My Site Lost All Search Traffic From Google

digg Share this on Facebook Huffpost - stumble reddit del.ico.us RSS

Since a few weeks ago, I've lost all Google search traffic to HollywoodChicago.com, which is my Chicago-based film publication (others are on staff, too). As you could imagine, this wouldn't exactly be a welcome cherry on anyone's sundae.

It was especially unnerving considering I only recently finished battling GoDaddy. They sent me a groundless hosting bill for $6,579.91 that was reduced to $969 and then waived entirely (in large part due to writing the story here).

So, why would a Web publication suddenly lose all its search traffic from Google and Google News? This Web publication, by the way, has been publishing original reviews, interviews and news on film, theater and TV since early 2000. There are a number of reasons why it holds Internet legitimacy.

It is published by a Chicago Film Critics Association-accredited film critic (yours truly), is approved on the Tomatometer at Rotten Tomatoes and has been cited in dozens of publications around the world including the Los Angeles Times, Chicago Sun-Times, USA Today, FOX Chicago, CBS 2 Chicago and AOL's Cinematical.

I don't mention these facts for bragging points. I mention them because (while it might seem like it would) they don't matter when it comes to this Google issue.

Google didn't stop indexing HollywoodChicago.com because the search behemoth suddenly thought the publication was illegitimate. Google also didn't destroy the site's PageRank for this same reason. PageRank, by the way, is the way Google measures how important your site is and how high your site appears for various search queries.

After spending nearly two years to earn a Google PageRank of 5/10, in the last couple weeks my PageRank dropped to an impossible 0/10. At 0/10, that means Google thinks my site is as unimportant as someone who'd throw up a blog today with absolutely nothing on it.

On my fix mission, my first step was finding out what went wrong. This proved to be challenging because (as you may know if you've ever tried to get in touch with a human at Google) it's rather difficult to get in touch with a human at Google.

I started by e-mailing my issue to press@google.com since I am a journalist and this was a legitimate problem. I got a quick response. Someone at Google forwarded the inquiry to someone else at Google. She wrote to me: "My colleague forwarded me your inquiry about PageRank and your site. Let me look into it and I'll get back to you."

A week went by and she went MIA. I sent several follow-up e-mails to her with no dice. The problem persisted. I started seeking other methods for correction.

Though many people likely don't know it exists, Google Webmaster Tools is a very helpful application. It shows you what your site looks like from the perspective of Google's brain. Google indexes your Web site using an automated Internet spider called Googlebot.

Google Webmaster Tools gives you a wealth of information about what Googlebot sees in your site along with what's wrong with it.

I didn't think to check there until I found a Google Group (a discussion forum) specifically to help Webmasters. Very knowledgeable people (including people who work for Google) respond there very quickly. It's an excellent way to leap that hurdle of actually getting to speak with Google humans.

John Mueller (who publicly lists himself as a Webmaster trends analyst at Google Zurich) was the first one to reply. He hit the nail on the head right out of the gate. John recommended that I check for errors with my site in Google Webmaster Tools.

Once I was alerted to do this, the graph below from Google Webmaster Tools tells the whole story. Notice the section I've highlighted in red. This crawl graph shows you when Google stopped indexing my site entirely.

2008-10-08-googlecrawlstats.jpg

Sure enough, there were thousands of new errors. There were 11,260 HTTP problems due to a "4xx error" along with 2,684 URLs restricted by robots.txt and 2,480 errors for URLs in my sitemap. In layman's terms, this means Google was being blocked from indexing my site.

Of course, I didn't mean to do that. So why did I?

John asked if I was banning any IP addresses from accessing my site. Sure enough, I was. Like most sites out there, spammers also have an interest in mine. While I have various technologies in place to block them from posting content that violates our policies, sometimes they sneak through. For them, I've banned their IP addresses directly.

In banning IP addresses directly in response to spammed content, I also accidentally banned Googlebot.

Whoops! How would I have known that? I didn't know one of the many IP addresses for Googlebot. I didn't know to do a reverse IP lookup to determine that banning 66.249.73.% was indeed preventing Googlebot from indexing my whole site.

(Tip: You can do a reverse IP lookup at a site like this to make sure you're not inadvertently banning an IP address that you shouldn't be).

(Another tip: If you encounter an issue similar to this, use this highly practical and responsive Google Group to help you diagnose your issue. This is what solved it for me.)

While Google Webmaster Tools clearly showed me the problem, I still wouldn't have known why until John tipped me off. Though I'm still waiting for Google to index my site again and for my PageRank to return, I can rest calmly now that I've definitively found the problem and I could potentially help to prevent you from making the same mistake.

While the answer here was indeed in existence, in the end it ultimately lacked visibility and a clear explanation of what to do about it.

Just as GoDaddy could have sent an automated e-mail notifying me that I was about to go way over my disk quota and be charged thousands of dollars in penalties, Google could have also sent me an automated e-mail to let me know that thousands of URLs had been restricted from their index.

That tool could be set with a user-defined threshold so you as a Webmaster could determine when you'd receive that message. You'd essentially pick how many errors would have to be registered in order for you to care about it.

For me, that simple e-mail would have saved me weeks of lost traffic, hassle and headaches. For you, that simple e-mail could prevent such an issue from ever happening in the first place. For Google, that simple and proactive e-mail could mean what's arguably the best search engine in the world could be just a bit better.

Since a few weeks ago, I've lost all Google search traffic to HollywoodChicago.com, which is my Chicago-based film publication (others are on staff, too). As you could imagine, this wouldn't exactly b...
Since a few weeks ago, I've lost all Google search traffic to HollywoodChicago.com, which is my Chicago-based film publication (others are on staff, too). As you could imagine, this wouldn't exactly b...
 
Comments
4
Pending Comments
0
iPhone App Promo

Want to reply to a comment? Hint: Click "Reply" at the bottom of the comment; after being approved your comment will appear directly underneath the comment you replied to

View Comments:

PageRank isn't a measure of how high your site will rank. It's a measure of how important your pages are seen as being, but pages with less PageRank can out rank you if they are more relevant for a particular topic. For example, Google itself has a PageRank 10 for its home page. But you don't see Google always ranked tops for everything you search on, because other factors go into the mix in ranking, as well.

Seeing that drop is indeed, as you noticed, a sign of potential problems. Fortunately, as you discovered when you started doing what "normal" people would have done, there are a range of tools Google has built up over the past three years that help you discover issues like you encountered and get them fixed.

Regarding the email you wanted, well, that's easier than it sounds. It's difficult to know who "owns" all the sites out there, especially when you consider some sites are within other domains. Google actually has tried to guess at people's emails to let them know if they've been blocked for spamming or malware. But the other issue is that you have people who purposely block Google using things like a robots.txt file. So does Google send potentially annoying emails saying "Hey, did you know you're blocking us?" And sometimes, only part of a site is blocked.

    Favorite    Flag as abusive Posted 04:06 PM on 10/13/2008

From one Adam to another...

I'm really sorry to hear about your troubles, but -- as one of the Googlers affiliated with Google's Webmaster Central -- I'm delighted to see your kind words about our Help Group (and also about my colleague John Mueller, whom we all love here, too!)

Your suggestion about notifications in Webmaster Tools is a great one. We've been showing more and more messages in the Message Center for exactly the same purpose (to alert webmasters to potential or actual problems with their site), and I'll share your specific recommendation (notifications re: major crawl errors) with the Tools team.

Thank you for sharing your experience so thoughtfully! I hope and trust that Googlebot will return soon to your site.

    Favorite    Flag as abusive Posted 03:38 PM on 10/11/2008
- Adam Fendelman - Huffpost Blogger I'm a Fan of Adam Fendelman permalink

Adam,

Glad to hear that. Cheers!

~ Adam

    Favorite    Flag as abusive Posted 12:48 AM on 10/13/2008

Besides the issue stated in this article there's also work presently being done by governments, powerful corporations and industries to shut down freedom of speech on the internet. So-called "conspiracy" sites are being labeled as "unreliable" and therefore will get a drop in page rank or censored. For instance, if you did a search on vaccines a few months ago you'd find most sites were critical of vaccines. Do the search today and most sites will be pro-vaccine and it's almost impossible to now find those anti-vaccine sites. It's no accident and as you might assume by seeing this type of censorship in China, that this is not good for the people.

http://www.naturalnews.com/024240.html

    Favorite    Flag as abusive Posted 02:18 AM on 10/11/2008
Comments are closed for this entry

You must be logged in to reply to this comment. Log in  or  Connect