Discussing the Quality of Scans and OCR Text

Hi everyone,

I want to start a conversation about the quality of scans and OCR text in the British Newspaper Archive. Personally, I’ve noticed that the clarity of scans can vary a lot depending on the original paper, the condition of the page, and even the printing style of the time.

A few questions I’d love to hear your thoughts on
1. How do you work around OCR errors when doing searches? Do you use wildcards, or just read through the scans manually?
2. Have you noticed certain decades or newspapers that tend to scan particularly well (or badly)?
3. Do you feel the image quality has improved over time as BNA adds more material, or is it still hit-and-miss?

Looking forward to hearing how others approach this challenge!
Geometry Dash

1 vote

Cooke Willis shared this idea · Sep 18, 2025 · Report… · Admin →

An error occurred while saving the comment

Give feedback

Knowledge Base

How can we improve The British Newspaper Archive?

Discussing the Quality of Scans and OCR Text

Feedback

Website improvements: feedback

Feedback and Knowledge Base

Searching…

Give feedback

Knowledge Base

The British Newspaper Archive

Discussing the Quality of Scans and OCR Text

We're glad you're here

We're glad you're here

We're glad you're here

We're glad you're here

Website improvements: feedback

Categories

Searching…

Give feedback

Knowledge Base

The British Newspaper Archive