Hello, and happy new year!
I believe Torstein Dybdahl were in contact with you earlier, regarding the norwegian languages and your corpus-building web crawler, URL:http://borel.slu.edu/crubadan/. He has since become busy with real life, and was unable to continue this effort. I am part of the group working with Torstein on the norwegian spell checking systems. I'm very pleased to discover that Norwegian Nynorsk (nn) and Northern Saami (se) are listed on the status page with lots of files and words registered.
We here in Norway are now in the process of revitalizing the Norwegian Bokmål and Nynorsk spell checking package, URL:http://no.speling.org/. This is a volunteer project. To do a good job with this, we need to find updated frequency information for the norwegian words. At the moment, we do not have access to a corpus nor frequency information for either of these languages.
In addition, a related group of people are funded by the Norwegian government to create spell checking systems for several of the Saami languages, URL:http://divvun.no/english.html. This work is organized by the university of Tromsø, and this group have access to a corpus, but could use more words.
If I understood Torstein correctly, you are willing to share your collection of words with us. But I've checked the web page, and been unable to find links to the word collection on your web pages. Where can I find the list of words, preferably with frequency information? Is the collection of files available on the web somewhere?
Norwegian Bokmål is missing from your status page. Would you be willing to collect documents for that language as well? If not, how hard is it to set up your software system on our servers so we can collect words for this language ourself?
Cc to the Norwegian and Saami translators mailing list, which is read by the people working on the spell checking systems.
Friendly,