Idea for work flow to handling importing of documents in bulk
Petter Reinholdtsen
pere at hungry.com
Mon Mar 20 22:26:36 CET 2017
I got this idea for a workflow for uploading documents in bulk, but am
unsure will work with Noark 5, and hope you can comment on it.
I would like to have a way to upload documents / emails to the archive
in bulk / non-interactively, and update the metadata later. My idea is
to have a way to automatically upload documents (say allow a mailing
list to store a copy of posted messages in the archive), import lots of
PDFs from a directory or set up a "print queue" that feed the
archive. The required metadata (title, to, from, file, etc) would be
automatically or manually added after the document is stored in the
archive. This would allow the archive system to guess as much metadata
as possible from the document itself. As far as I can tell, the Noark 5
web API is not made to allow this, and intuitive use of the API will
require metadata to be inserted before the dokument is uploaded.
But if I understand correctly, it is possible to have a noark 5 file
(mappe) in the archive which is not listed in the public journal. Is
this correct? Also, if I understand correctly, it is possible to move a
document from one file to another (I assume it would involve moving the
mappe -> dokumentbeskrivelse connection). Is this correct?
If so, each 'user' of the archiving system can have a 'temporary' file,
in which every automatically uploaded document is attached. The user
can then go through every document in her 'temporary' file, update the
document metadata and assign it to the appropriate file. This will give
the document the correct case ID and sequence number, and make it show
up in the public mail journal. It would be best if most or all the
metadata could be modified until the document is moved into a
'permanent' file, in case incorrect information is guessed or extracted
automatically.
Would such work flow work with Noark 5?
Which operations is needed to make a file that will not show up in the
public mail journal? What about moving a document from one file to
another?
--
Happy hacking
Petter Reinholdtsen
More information about the nikita-noark
mailing list