Retchel's Online Buzz

Sunday, January 13, 2008

Robot.txt file blocked googlebots crawling

I spent so many hours checking my webmaster tools on my google account. I found out that only few of my posts here and here that googlebot were able to indexed or crawled because of the robot.txt that is in my file . But I have no idea where that robot file came from. Is it from the hotmail codes? from the templates? or where?
There were 55 lists of my post url that were restricted by robot.txt and that googlebot had trouble crawling it. Maybe that's why, when I checked my google indexed it only shows up a few.
But I don't really understand what this mean, cause other websites want robot.txt file on their site cause they don't want any search engines to index the content of their page. But it does'nt make sense for me, as bloggers, we want our url post to be indexed by google.

If you want to check the status of your site, you can check on google websmaster tool here. Just log in to your google account and add you url site and once you verify your site, you can now view the status of it since the last time google crawled your site. You can also view your url's that google had hard time crawling and why they could not crawl them.

Anyway, so much for that, hopefully googlebot will be able to index my coming post for their next schedule crawl.

1 buzz me:

Anonymous said...: Are you talking about your Blogspot blog? Then you can't do anything about it because that's Google's default. Blogspot doesn't allow indexing of RSS and scrawling of label pages.

Your post page also have difficult getting up to Google index because they treat it as duplicate content. It might take a long time before any page which is featured on the front page to come to the index. Especially for pages with too many links.

Well that's my own experience and knowledge. It's funny that Google the search engine own blog host is causing duplicate content issues.; January 20, 2008 at 1:19 PM

I Disclosed

This policy is valid from 30 April 2008 This blog is a personal blog written and edited by me. This blog accepts forms of cash advertising, sponsorship, paid insertions or other forms of compensation. This blog abides by word of mouth marketing standards. We believe in honesty of relationship, opinion and identity. The compensation received may influence the advertising content, topics or posts made in this blog. That content, advertising space or post will be clearly identified as paid or sponsored content. The owner(s) of this blog is compensated to provide opinion on products, services, websites and various other topics. Even though the owner(s) of this blog receives compensation for our posts or advertisements, we always give our honest opinions, findings, beliefs, or experiences on those topics or products. The views and opinions expressed on this blog are purely the bloggers' own. Any product claim, statistic, quote or other representation about a product or service should be verified with the manufacturer, provider or party in question. The owner(s) of this blog would like to disclose the following existing relationships. These are companies, organizations or individuals that may have a significant impact on the content of this blog. To get your own policy, go to http://www.disclosurepolicy.org

About Me

List

Chalyza"s Blog

Jungle Beauty"s Corner(Che2x)

Blog Archive

My Blogroll

Sunday, January 13, 2008

Robot.txt file blocked googlebots crawling

1 buzz me:

Subscribe

Links

Feeds

My Visitors

I Disclosed

My Communities