hi
i need a little help with a problem my site isnt being indexed by google and it is something that i think that we really need my site www.jollygreenthumb.com has been up since mid june and google has attempted to crawl the site 9-10 times and it has yet to index the first page
i have a sitemap built and google reports no errors in it yet every time it tries to crawl this site it records an error Network unreachable in the details it gives Unreachable URL
the site is being hosted by firmsupport and have made changes in an attempt to correct the problem but to no avail
of course i have not received any answers to my emails to google concerning my problems - and i hope someone here to shed a little light on them
this error has happened on .html pages and .a5w pages
i have included an xl file that i downloaded from googles sitemap site that list the errors and pages
in an effort to attempt to find out the problem i have built a simple html file and served it up on my little server https://www.npwas.com/text.html
and i have used the wordtool on googles adsense site to make the googlebot attempt to crawl the page
i think that you must have an adsense account to use this but the url is https://adwords.google.com/select/KeywordTool
and if you select the 2nd tab "Site-Related Keywords " google will send the googlebot to the page and attempt to scan it for keywords
and each time i use this tool both for jgt and npwas it returns an error:
Unable to access https://www.npwas.com/text.html.
We're sorry, but we can't seem to access the URL https://www.npwas.com/text.html. Please make sure you have entered the URL correctly and that the site is currently available. Or, begin the keyword generation process again by entering a different URL.
i am assuming that the adsense tool is a good proxy for the regular googlebot and that the 2 errors are the same
i have included my server logs from www.npwas.com test
it seems that the was is attempting to serve the page but google still
reports an error 3 issues came to mind
is it possible to change the settings in the WAS to not send the sessionid in the url??
is it possible to change this setting just for bots and spiders?
does anyone know how to find out if google can crawl and .a5w page ?
thanx in advance
i need a little help with a problem my site isnt being indexed by google and it is something that i think that we really need my site www.jollygreenthumb.com has been up since mid june and google has attempted to crawl the site 9-10 times and it has yet to index the first page
i have a sitemap built and google reports no errors in it yet every time it tries to crawl this site it records an error Network unreachable in the details it gives Unreachable URL
the site is being hosted by firmsupport and have made changes in an attempt to correct the problem but to no avail
of course i have not received any answers to my emails to google concerning my problems - and i hope someone here to shed a little light on them
this error has happened on .html pages and .a5w pages
i have included an xl file that i downloaded from googles sitemap site that list the errors and pages
in an effort to attempt to find out the problem i have built a simple html file and served it up on my little server https://www.npwas.com/text.html
and i have used the wordtool on googles adsense site to make the googlebot attempt to crawl the page
i think that you must have an adsense account to use this but the url is https://adwords.google.com/select/KeywordTool
and if you select the 2nd tab "Site-Related Keywords " google will send the googlebot to the page and attempt to scan it for keywords
and each time i use this tool both for jgt and npwas it returns an error:
Unable to access https://www.npwas.com/text.html.
We're sorry, but we can't seem to access the URL https://www.npwas.com/text.html. Please make sure you have entered the URL correctly and that the site is currently available. Or, begin the keyword generation process again by entering a different URL.
i am assuming that the adsense tool is a good proxy for the regular googlebot and that the 2 errors are the same
i have included my server logs from www.npwas.com test
it seems that the was is attempting to serve the page but google still
reports an error 3 issues came to mind
- does google recognize the .a5w extension as a valid script extension? i had to list it before the sitemap generator recognized it
i sent an email to google with this question and they blew me off and sent me to forum hell
i used google search to check for sites and pages using .a5w as the page extension and i found a few but i didnt find any instances that i could say for sure that the .a5w page was crawled - and new links were found - just listing in the index i dont think means that the page was crawled - it could be the destination of a link - is this an issue of having a session id in the url? i understand ,reading some forums, that this may be an issue with google and i cant see what the was returned to the googlebot but if its the same as whats in the logs this might be an issue
accord to Lenny in this post
http://msgboard.alphasoftware.com/al...76503#poststop
The server will always add the sesison ID to the URL for the first request because it does not yet knwo if the client has a cookie.
that have been indexed but this doesnt mean that the googlebot actually crawled these pages they could just be links that werent followed - cant tell - i am currently wondering if the request "Accept-Encoding: gzip" might have anytthing to do with it
is it possible to change the settings in the WAS to not send the sessionid in the url??
is it possible to change this setting just for bots and spiders?
does anyone know how to find out if google can crawl and .a5w page ?
thanx in advance
Comment