Parallelization av import skriptet

Thomas Sødring tsodring at oslomet.no
Mon Aug 23 17:55:54 CEST 2021


On 8/23/21 3:55 PM, Gareth Western wrote:
>
> I’m making some progress with testing the import of this large 
> uttrekk. The next issue I’d like to look at is performance. Serial 
> upload is too slow when the archive contains hundreds of thousands of 
> documents. A quick analysis of the structure indicates that there are 
> approximately 400,000 documents and 100,000 folders (mappe). Would it 
> be safe to do at this level? Or should I try at a higher level first 
> (e.g. klasse, of which there are approximately 4000)?
>
>
Great news! We really appreciate the real world testing! nikita should 
not have any scalability issues with the database. The numbers you note 
is not that much so I hope nikita handles it nicely. It certainly makes 
sense to first try the import of 4000 klasse. I assume you are not 
importing anything underneath that (e.g. saksmappe). Such an import 
should be quick.

If you have various arkivdel in your extraction you can try to limit the 
import at the arkivdel level. Or you could try to limit it at the klasse 
level just to get any idea of performance.

We would be very happy to see performance results here on the list. Good 
or bad :)

  - Thomas



-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://lists.nuug.no/pipermail/nikita-noark/attachments/20210823/472eff6f/attachment.htm 


More information about the nikita-noark mailing list