Forums / Developer / RSS import generates duplicates

RSS import generates duplicates

Author Message

Kevin Gaudin

Monday 05 October 2009 2:19:24 am

I'm running a site on v3.8.6. (Migration to v.4.0 is planned to november)

Is there a reason why RSS Import would duplicate some entries from this feed ?
http://www.toulouse.fr/web/guest/actualite/lire/-/journal/rss/10128/ACTUALITES

All duplicated entries have guid formed this way :

<item>
      <title>Mobilisation pour un référendum sur le service public postal</title>
      <link>http://www.toulouse.fr/web/guest/actualite/lire/-/journal_content/56_INSTANCE_3iH6/10128/1436279</link>
      <description />
      <pubDate>Fri, 02 Oct 2009 13:05:00 GMT</pubDate>
      <guid>http://www.toulouse.fr/web/guest/actualite/lire/-/journal_content/56_INSTANCE_3iH6/10128/1436279</guid>
      <dc:creator>Léonard Lemince</dc:creator>
      <dc:date>2009-10-02T13:05:00Z</dc:date>
    </item>

Whereas those which are not duplicated are formed this way :

    <item>
      <title>Le site de la semaine!</title>
      <link>http://www.toulouse.fr/web/guest/economie/innovation-technologies-de-l-information</link>
      <description />
      <pubDate>Thu, 01 Oct 2009 13:51:00 GMT</pubDate>
      <guid>http://www.toulouse.fr/web/guest/economie/innovation-technologies-de-l-information</guid>
      <dc:creator>Djet 27</dc:creator>
      <dc:date>2009-10-01T13:51:00Z</dc:date>
    </item>

Any hint welcome... I did not find anything regarding this in issues.ez.no.

Thanks !

Twitter: @kevingaudin

Kevin Gaudin

Monday 05 October 2009 2:26:01 am

For each duplicate entry, eZ seems to add a different "jsessionid" to the link ! :

First entry :
http://www.toulouse.fr/web/guest/actualite/lire/-/journal_content/56_INSTANCE_3iH6/10128/1436279;jsessionid=1E0169E94E9694BD6D0AA2E15A2F46B0

Duplicate :
http://www.toulouse.fr/web/guest/actualite/lire/-/journal_content/56_INSTANCE_3iH6/10128/1436279;jsessionid=300C982D2BD28725BC61A1396C801A9A

This jsessionid does not appear anywhere in the original feed !

Twitter: @kevingaudin

Kevin Gaudin

Monday 05 October 2009 5:33:13 am

Well, after further investigation, it looks like a bug from liferay (generator of the rss feed) which puts these jsessionid values when called from a cookieless client.

Twitter: @kevingaudin