Overleg index:Het Koninkrijk Deel 01 Voorspel (1969).djvu
Hier gaat iets fout
bewerkenVanaf pag. 14 gaat er iets helemaal fout. De OCR loopt niet meer gelijk met de scans. Ik stop hier even mee. Eens kijken hoe dit moet worden opgelost. --Dick Bos (overleg) 3 feb 2016 21:45 (CET)
Request for Help
bewerkenthe following message is posted in the Scriptorium of en-ws:
Request for help with a problem on Dutch Wikisource.
Please help me with a problem on Dutch Wikisource. It's a problem with OCR-layer of the text of this book: s:nl:Index:Het_Koninkrijk_Deel_01_Voorspel_(1969).djvu
The pdf is on Commons: commons:File:L._de_Jong_-_Het_Koninkrijk_der_Nederlanden_in_de_Tweede_Wereldoorlog_1939-1945_Deel_1_Voorspel.pdf
This pdf was uploaded to IA (cf. this procedure).
The djvu thus created was uploaded to commons again: commons:File:Het_Koninkrijk_Deel_01_Voorspel_(1969).djvu
This djvu is used in Dutch Wikisource.
The problem is this: from (djvu-)page 25 onward (page 14) there is a page missing in ocr: so the ocr is not corresponding to the scanned image, but to the next page. Further up there are more pages missing. I don't exactly know which ones. I suppose a total of 24 pages is missing: djvu page 744 is the last one with an ocr. After that there are 24 more scanned pages in the book.
Can anyone explain what's gone wrong here? And how could I solve this problem?
After the moment I discovered this problem, I uploaded a new version (directly from NIOD) to IA. And again to Commons. And made this s:nl:Index:De_Jong_-_Koninkrijk_Deel_01_Voorspel_(1969).djvu. This file shows exactly the same problem. So it looks like there's something corrupt in the original pdf.