Google crawling elasticsearch.org?


(searchersteve) #1

This is maybe a question to Kimchy, but I'll share with the community:

Since your site redesign, I've noticed that every time I do a simple Google search for your documentation, I end up with a link that is no longer working. Google doesn't appear to have a current sitemap or something. I assume you have a decent amount of traffic. I wonder why won't Google update the information?

I've finally accustomed myself to going to the site and then searching, which is of course what I should be doing anyway. I'm just lazy and addicted to my Google browser plugin.

-- Steve


(Shay Banon) #2

To be honest, I am at a loss here. I have done everything I can to "tell" google to stop/remove elasticsearch.com. It redirects using 301, bad links return 404, web master tools has been update to reflect that, and it shows it does index elasticsearch.org.

My current plan is to buy a shotgun, get to their data centers, and shoot all the replicas holding the elasticsearch.com indices ("bullet time" style of course, to reduce chances of recoverability).

-shay.banon

p.s. Note to google version of Echelon, the previous paragraph is a joke.
On Monday, May 9, 2011 at 12:10 AM, searchersteve wrote:

This is maybe a question to Kimchy, but I'll share with the community:

Since your site redesign, I've noticed that every time I do a simple Google
search for your documentation, I end up with a link that is no longer
working. Google doesn't appear to have a current sitemap or something. I
assume you have a decent amount of traffic. I wonder why won't Google update
the information?

I've finally accustomed myself to going to the site and then searching,
which is of course what I should be doing anyway. I'm just lazy and addicted
to my Google browser plugin.

-- Steve

--
View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Google-crawling-elasticsearch-org-tp2916515p2916515.html
Sent from the ElasticSearch Users mailing list archive at Nabble.com.


(Lukáš Vlček) #3

Just found this video talk about this subject:
http://www.youtube.com/googlewebmasterhelp#p/u/16/QyQs3tz7ZKo

http://www.youtube.com/googlewebmasterhelp#p/u/16/QyQs3tz7ZKoHTH
Lukas

On Sun, May 8, 2011 at 11:43 PM, Shay Banon shay.banon@elasticsearch.comwrote:

To be honest, I am at a loss here. I have done everything I can to "tell"
google to stop/remove elasticsearch.com. It redirects using 301, bad links
return 404, web master tools has been update to reflect that, and it shows
it does index elasticsearch.org.

My current plan is to buy a shotgun, get to their data centers, and shoot
all the replicas holding the elasticsearch.com indices ("bullet time"
style of course, to reduce chances of recoverability).

-shay.banon

p.s. Note to google version of Echelon, the previous paragraph is a joke.

On Monday, May 9, 2011 at 12:10 AM, searchersteve wrote:

This is maybe a question to Kimchy, but I'll share with the community:

Since your site redesign, I've noticed that every time I do a simple Google
search for your documentation, I end up with a link that is no longer
working. Google doesn't appear to have a current sitemap or something. I
assume you have a decent amount of traffic. I wonder why won't Google
update
the information?

I've finally accustomed myself to going to the site and then searching,
which is of course what I should be doing anyway. I'm just lazy and
addicted
to my Google browser plugin.

-- Steve

--
View this message in context:
http://elasticsearch-users.115913.n3.nabble.com/Google-crawling-elasticsearch-org-tp2916515p2916515.html
Sent from the ElasticSearch Users mailing list archive at Nabble.com.


(medcl.net) #4

crazy plan~

From: Shay Banon
Sent: Monday, May 09, 2011 5:43 AM
To: users@elasticsearch.com
Subject: Re: Google crawling elasticsearch.org?

To be honest, I am at a loss here. I have done everything I can to "tell" google to stop/remove elasticsearch.com. It redirects using 301, bad links return 404, web master tools has been update to reflect that, and it shows it does index elasticsearch.org.

My current plan is to buy a shotgun, get to their data centers, and shoot all the replicas holding the elasticsearch.com indices ("bullet time" style of course, to reduce chances of recoverability).

-shay.banon

p.s. Note to google version of Echelon, the previous paragraph is a joke.

On Monday, May 9, 2011 at 12:10 AM, searchersteve wrote:

This is maybe a question to Kimchy, but I'll share with the community:

Since your site redesign, I've noticed that every time I do a simple Google
search for your documentation, I end up with a link that is no longer
working. Google doesn't appear to have a current sitemap or something. I
assume you have a decent amount of traffic. I wonder why won't Google update
the information?

I've finally accustomed myself to going to the site and then searching,
which is of course what I should be doing anyway. I'm just lazy and addicted
to my Google browser plugin.

-- Steve

--
View this message in context: http://elasticsearch-users.115913.n3.nabble.com/Google-crawling-elasticsearch-org-tp2916515p2916515.html
Sent from the ElasticSearch Users mailing list archive at Nabble.com.


(searchersteve) #5

Ahhh. My sympathies. I hope it won't come to that. :]


(Karel Minarik) #6

Maybe there's some mismatch with .org/.com domains:

host elasticsearch.org
elasticsearch.org has address 207.97.227.245
host 207.97.227.245
245.227.97.207.in-addr.arpa domain name pointer pages.github.com

but:

host elasticsearch.com
elasticsearch.com has address 64.202.189.170
host 64.202.189.170
170.189.202.64.in-addr.arpa domain name pointer pwfwd-
v01.prod.mesa1.secureserver.net.

Maybe google fails on those redirects from .com => .org? I think both
should point to GH pages.

Karel

On May 8, 11:43 pm, Shay Banon shay.ba...@elasticsearch.com wrote:

To be honest, I am at a loss here. I have done everything I can to "tell" google to stop/remove elasticsearch.com. It redirects using 301, bad links return 404, web master tools has been update to reflect that, and it shows it does index elasticsearch.org.

My current plan is to buy a shotgun, get to their data centers, and shoot all the replicas holding the elasticsearch.com indices ("bullet time" style of course, to reduce chances of recoverability).

-shay.banon

p.s. Note to google version of Echelon, the previous paragraph is a joke.

On Monday, May 9, 2011 at 12:10 AM, searchersteve wrote:

This is maybe a question to Kimchy, but I'll share with the community:

Since your site redesign, I've noticed that every time I do a simple Google
search for your documentation, I end up with a link that is no longer
working. Google doesn't appear to have a current sitemap or something. I
assume you have a decent amount of traffic. I wonder why won't Google update
the information?

I've finally accustomed myself to going to the site and then searching,
which is of course what I should be doing anyway. I'm just lazy and addicted
to my Google browser plugin.

-- Steve

--
View this message in context:http://elasticsearch-users.115913.n3.nabble.com/Google-crawling-elast...
Sent from the ElasticSearch Users mailing list archive at Nabble.com.


(system) #7