+1-888-365-2779
Try Now
More in this section

Forums / General Discussions / Page Index not working properly

Page Index not working properly

9 posts, 0 answered
  1. shraddha
    shraddha avatar
    87 posts
    Registered:
    18 May 2009
    23 Jun 2010
    Link to this post
    Hi,
    I am using sitefinity search functionality. When I create news index, search gives proper results for that. But when I create page index for any section, appx. only 2 words per page are getting index and I am not getting results for any of the word(s) in page, not even for the title of the page. What can be the issue??


    --Thanx
    Shraddha
  2. Ivan Dimitrov
    Ivan Dimitrov avatar
    16072 posts
    Registered:
    25 Nov 2016
    23 Jun 2010
    Link to this post
    Hi shraddha,

    There could be some control/logic on your master page or on the pages you are indexing and this control breaks the crawler.

    You could also try the following

    1. Crate a new master page without any css, js and controls on it. Add just one ContentPlaceHolder.
    2. Create a new page and set the page to use the newly created template with a single ContentPlaceHolder
    3. Drop a GenericContent control on this page and type a simple text in it - say "Telerik".
    4. Save the page and make sure that it is with status "Published".
    5. Run the search index.
    6. Go to the page where you have already dropped SearchResults and SearchBox control and look up for "Telerik"

    The crawler does not index pages when

    1. Page is not added to the index - some pages are explicitly excluded when you index from a certain node
    2. There is some code/logic that breaks the crawler - for instance you use session somewhere or you check Identity.IsAuthenticated. In addition to the steps 1- 6 above you could clear the error log under App_Data/sitefinity.log, then run the index and see whether there are any errors thrown.

    3. fieldsInfoProvider.xml has been modified and weight is set to -1 for some of the holders.



    Kind regards,
    Ivan Dimitrov
    the Telerik team
    Do you want to have your say when we set our development plans? Do you want to know when a feature you care about is added or when a bug fixed? Explore the Telerik Public Issue Tracking system and vote to affect the priority of the items
  3. shraddha
    shraddha avatar
    87 posts
    Registered:
    18 May 2009
    24 Jun 2010
    Link to this post
    Hi,
    Thanx for the reply.
    Yes, there was session used in one of my common control which was used all over the website. When I removed that,  page index worked properly.
    But what is the solution if I have to use session???


    --Thanx
    Shraddha
  4. Ivan Dimitrov
    Ivan Dimitrov avatar
    16072 posts
    Registered:
    25 Nov 2016
    24 Jun 2010
    Link to this post
    Hello shraddha,

    Try to add the logic or methods which calls the session objects inside the condition below

    if  (!CmsContext.IsRequestCrawler(this.Context))
     {
         // this is not a crawler request - here execute your logic which involves session objects
     
      }


    All the best,
    Ivan Dimitrov
    the Telerik team
    Do you want to have your say when we set our development plans? Do you want to know when a feature you care about is added or when a bug fixed? Explore the Telerik Public Issue Tracking system and vote to affect the priority of the items
  5. shraddha
    shraddha avatar
    87 posts
    Registered:
    18 May 2009
    24 Jun 2010
    Link to this post
    Hi,
    It worked.. Thanx a lot.....


    --Shraddha
  6. Richard Baugh
    Richard Baugh avatar
    201 posts
    Registered:
    22 Aug 2012
    29 Jun 2010
    Link to this post
    I've got a slightly similar problem. The pages are being indexed, but the content is not. I have checked the index and no data is showing up in the content section while the guid and page sections have data. The page section shows 29 pages. This is correct. But there is no data in the content section.

    I have tried deleting the whole index folder and then re-indexing. Still no luck. I don't know of any sessions that would be causing any problems. I turned Healthmonitoring on and nothing was added to the log file.

    What html elements or css styles would cause this to not function? Wouldn't it just read the text from say a view source perspective?
  7. Ivan Dimitrov
    Ivan Dimitrov avatar
    16072 posts
    Registered:
    25 Nov 2016
    29 Jun 2010
    Link to this post
    Hello Richard Baugh,

    The page index gets rendered html. Have you tried the suggestion with creating a sample page without any css, js etc and using Generic Content control with a simple test?

    Regards,
    Ivan Dimitrov
    the Telerik team
    Do you want to have your say when we set our development plans? Do you want to know when a feature you care about is added or when a bug fixed? Explore the Telerik Public Issue Tracking system and vote to affect the priority of the items
  8. Richard Baugh
    Richard Baugh avatar
    201 posts
    Registered:
    22 Aug 2012
    29 Jun 2010
    Link to this post
    I just created the master page with no css and no js. Added content to the page and re-indexed just to make sure. Still no luck. The content section of the index still has no data. The page count increased as it should.

    Funny thing is that the admin page for the index shows that there are now 62 words indexed where before it only had 60. I assume it is getting the title and meta values although these are not showing up in the index files either.

    Btw, I am using the luke application to view the index.
  9. Ivan Dimitrov
    Ivan Dimitrov avatar
    16072 posts
    Registered:
    25 Nov 2016
    30 Jun 2010
    Link to this post
    Hi Richard Baugh,

    By default the page index includes  page title, keywords" and description". The parser that is used is for text/html content. Scripts and styles are excluded. The crawler uses HttpContext.Current when you perform indexing. If the HttpContext.Current or you are altering it by a custom handler/HttpModule there could be an issue. The index gets the HTML content by using ASP.NET StringWriter class.
    If the problem persists  you could open  a support request and attach a sample project that I can debug locally and see what the problem could be.

    Kind regards,
    Ivan Dimitrov
    the Telerik team
    Do you want to have your say when we set our development plans? Do you want to know when a feature you care about is added or when a bug fixed? Explore the Telerik Public Issue Tracking system and vote to affect the priority of the items
Register for webinar
9 posts, 0 answered