1

Davy Van Den Bremt: Three things we learned from indexing a Drupal site with millions of nodes in Apache SOLR

http://www.drupalcoder.com

For one of our clients, we are running a Drupal site with about a millions of nodes. Before launch, those nodes are imported from another database and then indexed into Apache SOLR. The total time to index all of these nodes in an empty SOLR instance is measured in days rather than hours or minutes.
A bit too long to do this import regularly. So me and my (XDebug) profiler delved into the Apache SOLR module code to look where we could scrape of a few hours/days of the execution time.
Seemed like in our case, there were 3 components responsible for a large share of the execution time. Let's have a look.
BTW. We are using the latest dev build of version 2 of the Apache SOLR module.
read more

Read »
admin's picture
Created by admin 1 year 22 weeks ago – Made popular 1 year 22 weeks ago
Category: Open Source CMS   Tags:

Your Ad Here

User login

Who's online

There are currently 0 users and 8 guests online.

Best karma users

  1. bands
  2. sri's picture
    sri
  3. shashi
  4. sunnyholic
  5. admin user