Discussing the Quality of Scans and OCR Text
Hi everyone,
I want to start a conversation about the quality of scans and OCR text in the British Newspaper Archive. Personally, I’ve noticed that the clarity of scans can vary a lot depending on the original paper, the condition of the page, and even the printing style of the time.
A few questions I’d love to hear your thoughts on
1. How do you work around OCR errors when doing searches? Do you use wildcards, or just read through the scans manually?
2. Have you noticed certain decades or newspapers that tend to scan particularly well (or badly)?
3. Do you feel the image quality has improved over time as BNA adds more material, or is it still hit-and-miss?
Looking forward to hearing how others approach this challenge!
Geometry Dash