Viernes, 18 Abril 2014 20:56
beggarly

Problems with Google indexing XML sitemap

An open forum for opinions and general questions

Problems with Google indexing XML sitemap

Postby mashut » Mon Sep 23, 2013 6:42 am

Hi! I have a multilingual site with the default English language. The problem is that the English sitemap which is generated by the component is indexed with /en/ directory by Google, although the /en/ suffix is not seen in the sitemap itself:
http://www.afstyle.eu/index.php?option=com_xmap&view=xml&id=7

First I thought that this is a Google's cache point, but then I created a new map with a new URL and still Google has indexed it with /en/. It is really a problem as the web site does not use this suffix and thus it is blocked in robots.

So I wonder if it is a Google's problem or the component that still somehow uses the directory /en/ when generating the map. Thanks in advance for any help.
mashut
Fresh Boarder
Fresh Boarder
 
Posts: 6
Joined: Mon Sep 23, 2013 6:34 am

Re: Problems with Google indexing XML sitemap

Postby guilleva » Fri Sep 27, 2013 3:22 am

Hi, no, that doesn't seems to be a problem with Xmap, as you said, the sitemap isn't including de prefix... I cannot tell why Google is accessing it, just make sure that you submitted it correctly in Google's webmasters dashboard.
User avatar
guilleva
Administrator
Administrator
 
Posts: 1527
Joined: Wed Sep 12, 2007 3:10 am
Location: San José, Costa Rica

Re: Problems with Google indexing XML sitemap

Postby mashut » Fri Sep 27, 2013 12:18 pm

Thank you for your answer. I've discussed the problem on Google's forum:
https://productforums.google.com/forum/#!category-topic/webmasters/crawling-indexing--ranking/a8gQoZD3Fjg

I have to say that it is not my fault (the sitemap was added correctly to the Google WMT), but the bug of Xmap and Joomla. The problem is that Joomla 2.5 has a possibility of removing language alias from the default language URLs (language filter). There's something in the code of Xmap that still allows Google to fetch the default language alias that was removed by the system.

So I've manually created a new sitemap, copied the contents of the automatic Xmap and Google added it without any problems or errors. This means that the problem is with the Xmap code.

I think you should think over it as an important bug.
mashut
Fresh Boarder
Fresh Boarder
 
Posts: 6
Joined: Mon Sep 23, 2013 6:34 am

Re: Problems with Google indexing XML sitemap

Postby guilleva » Fri Sep 27, 2013 2:12 pm

hi, thanks for letting me know about that, but I don't see Xmap adding the language alias to any url on this sitemap:

http://www.afstyle.eu/index.php?option=com_xmap&view=xml&id=7

that sitemap is being generated by Xmap, right?
User avatar
guilleva
Administrator
Administrator
 
Posts: 1527
Joined: Wed Sep 12, 2007 3:10 am
Location: San José, Costa Rica

Re: Problems with Google indexing XML sitemap

Postby mashut » Fri Sep 27, 2013 2:22 pm

Yes, the sitemap is generated by Xmap. I also cannot see the language alias in the map, but the problem is that Google sees it! I am not a programmer and cannot tell you why this happens. But I've created 3 site maps by Xmap (with different ids) and in each one Google finds the alias /en/. When I create a sitemap manually copying the contents from Xmap, Google cannot see the alias and everything is OK:
http://www.afstyle.eu/sitemap.xml (manually created sitemap)
So I think the problem is with Xmap code and Joomla language filter...
mashut
Fresh Boarder
Fresh Boarder
 
Posts: 6
Joined: Mon Sep 23, 2013 6:34 am

Re: Problems with Google indexing XML sitemap

Postby guilleva » Mon Sep 30, 2013 3:16 am

mashut wrote:When I create a sitemap manually copying the contents from Xmap, Google cannot see the alias and everything is OK...


What do you exactly means with "Google cannot see the alias"? I don't understand... If the alias is not on the sitemap... then you, me or Google just won't see it.So, I don't know what I can say, if the problem is that the site map is accessible using the alias (at /en/index.php...), then there is anything Xmap can do, it's a problem with your site's configuration and you should prevent your site being accessible using that alias on any page, not just Xmap.

So, I'm not understanding correctly, please tell me what do your exactly means with google seeing or not seeing the alias... How do you know what is Google seeing?
User avatar
guilleva
Administrator
Administrator
 
Posts: 1527
Joined: Wed Sep 12, 2007 3:10 am
Location: San José, Costa Rica

Re: Problems with Google indexing XML sitemap

Postby mashut » Mon Sep 30, 2013 5:45 am

In WMT there is a tool "Fetch as Google". When I add the sitemap there, I can instantly see the URL as Google. And I see that all the URLs in the sitemap has /en/ (though we cannot see them in the code). I do not know, where Google finds them (I've tried to create 3 sitemaps with diffrernt ids and all the same Google fetched each with /en/), but I suppose it's a bug in Joomla or in Xmap.

PS I cannot do a redirect from /en/ to the main directory as the language filter does not change the languages correctly in that case...

If you think I'm crazy you can try to add your cart yourself to WMT section "Fetch as Google" (SEF on, language filter on, default language alias is off).
mashut
Fresh Boarder
Fresh Boarder
 
Posts: 6
Joined: Mon Sep 23, 2013 6:34 am

Re: Problems with Google indexing XML sitemap

Postby b2un0 » Mon Sep 30, 2013 8:51 am

[English]

The language attribute musst set in the menu item, not in xmap.

what are the links? can you post example links who you add in WMT?

[German]

Der Sprachparameter für die Sitemap kommt auch nicht über die Sitemap, sondern über die Menüpunkte.

Kannst du die Sitemap Links posten die du auch in den WMT eingetragen hast?
z-index development
User avatar
b2un0
Junior Boarder
Junior Boarder
 
Posts: 22
Joined: Mon Jul 08, 2013 9:46 pm
Location: Kiel, Germany

Re: Problems with Google indexing XML sitemap

Postby mashut » Mon Sep 30, 2013 9:08 am

I have already placed here the link to the sitemap I tried to add to WMT:
http://www.afstyle.eu/index.php?option=com_xmap&view=xml&id=7
mashut
Fresh Boarder
Fresh Boarder
 
Posts: 6
Joined: Mon Sep 23, 2013 6:34 am

Re: Problems with Google indexing XML sitemap

Postby mashut » Mon Sep 30, 2013 9:18 am

Here is the proof that Google fetches all the Urls in my sitemap with /en/:
http://www.afstyle.eu/sitemap

Here is the cache:
http://webcache.googleusercontent.com/search?q=cache:x7WEShhSp0oJ:www.afstyle.eu/sitemap+&cd=1&hl=ru&ct=clnk

The problem is that these URLS have never had /en/ in their URLs, it is not the problem of Google cache!!!
mashut
Fresh Boarder
Fresh Boarder
 
Posts: 6
Joined: Mon Sep 23, 2013 6:34 am

Next

Return to General



Who is online

Users browsing this forum: Exabot [Bot] and 3 guests