J
My feedback
17 results found
-
59 votes
We have developed a prototype data feed from the digitisation workflow software which will provide much more granular data in future. Now we are looking at ways too feed this data through to the website and make it presentable
An error occurred while saving the comment -
80 votes
We’ve now done a thorough review of NewsVault and will be taking some of the better design features into our product roadmap.
An error occurred while saving the comment J commentedPlanned in 2015? It is now 8 years later and the layout is still awfully restrictive.
-
1 voteJ shared this idea ·
-
35 votes
An error occurred while saving the comment J commentedAlso, the search should be able to search for what was in the original citation. For example, a newspaper that shows under the comment as "Published: 4th Feb 1924" should at least let me search for 1924 and find it.
BUT I have to actually put 1924 into my comment as I save the bookmark for it to be searchable. Madness. -
6 votesJ supported this idea ·
-
132 votes
An error occurred while saving the comment J commentedDear ADMIN
If this was supposed to be improving since 2014, why is there no evidence? Is it because BNA are rushing into NEW digitisations, when there is such a huge amount of unsearchable information out there? It is pot luck whether the image scan and the OCR at the time the page was digitised are even remotely related to what we as humans can read.
Or even that the same image in Google can transcribe.
For goodness sake, they can now read information in dead languages on papyrus rolls that have been incinerated in the pyroclastic flows from the Vesuvius destruction of Pompeii and Herculaneum!An error occurred while saving the comment J commentednot only that, but even the FREE google text recognition in google keep on my phone is better!
An error occurred while saving the comment J commentedIt is.
Can I suggest you add your votes to this post which has a lot of votes already - maybe if it grows enough some notice will be taken.
https://help-and-advice.britishnewspaperarchive.co.uk/forums/243685-search-improvements/suggestions/42821181-replace-the-ocr-engineAn error occurred while saving the comment J commentedThe font is no excuse.
I've just used Google keep and "extract text" using an image downloaded from the BNA site - the article that made me look here. About 99% success, compared to less than 10%. The BNA OCR attempt was utter garbage, with a multitude of non-normal characters such as
"cultivator y® •®f honours. "
"continued to . ~ * • -ijii rr*, live "
"of the dung, and PJ trick : ft . r * v ett - was fc *• , . .. . , , , , Westminster School,"Google Keep gave spelling errors, and wrong letters but not this waste of space.
The BNA also has utterly strange ways of selecting parts of the page to scan together, because in every article I have seen so far, the sections are a complete mix of different things. Mine is a farming report merged (BNA didn't spot the column divider!) with the Death of a Marquis, plus the Intelligence from Cambridge University. an Arrest Report intermingled with both bits.
How on earth dare the BNA suggest that their OCR is accurate?
An error occurred while saving the comment J commentedThis is already a top suggestion - vote for it!
https://help-and-advice.britishnewspaperarchive.co.uk/forums/243685-search-improvements/suggestions/5953136-improve-optical-character-recognitionAn error occurred while saving the comment J commentedAbsolutely. Sadly I've now used my votes on similar suggestions.
J supported this idea ·An error occurred while saving the comment J commentedI've just tried it on the article that made me look here. About 95% success, compared to less than 10%. The BNA OCR attempt was utter garbage, with a multitude of non-normal characters such as
"cultivator y® •®f honours. "
"continued to . ~ * • -ijii rr*, live "
"of the dung, and PJ trick : ft . r * v ett - was fc *• , . .. . , , , , Westminster School,"Running a portion of the image as downloaded through Google Keep gave spelling errors, and wrong letters but not this waste of space.
The BNA also has utterly strange ways of selecting parts of the page to scan together, because in every article I have seen so far, the sections are a complete mix of different things. Mine is a farming report merged (BNA didn't spot the column divider!) with the Death of a Marquis, plus the Intelligence from Cambridge University. an Arrest Report intermingled with both bits.
How on earth dare the BNA suggest that their OCR is accurate?
-
32 votes
We will think about this.
An error occurred while saving the comment J commentedAbsolutely. I have spent hours correcting not just my own ancestors details but also names of those in the same article. Sometimes also nearby BMD.
-
8 votes
An error occurred while saving the comment J commentedthis post has the most votes so far as I can see
https://help-and-advice.britishnewspaperarchive.co.uk/forums/243685-search-improvements/suggestions/42821181-replace-the-ocr-engine -
3 votes
An error occurred while saving the comment J commentedYou're absolutely right that it is abysmal, but I suggest voting for this article which so far has the most votes already. https://help-and-advice.britishnewspaperarchive.co.uk/forums/243685-search-improvements/suggestions/42821181-replace-the-ocr-engine
-
1 vote
An error occurred while saving the comment J commentedThere is nothing in Huntingdonshire since 1899
-
9 votesJ supported this idea ·
-
1 voteJ shared this idea ·
-
23 votes
We’ll find out how hard this is to do
An error occurred while saving the comment J commentedAbsolutely. It seems ludicrous that the download gives NO IDEA of where you were looking. As it is, it would be better as an image that I could run through PROPER OCR and get a better transcription.
It would need to include the corrections that kind hearted people have made.The cheap scanner I have lets me save a scan as a searchable pdf. How hard can it be?
How can it take 6 years to review how hard this improvement might be?
-
64 votesJ supported this idea ·
An error occurred while saving the comment J commentedHow can this be under review for 6 years? Search is nearly as bad as the OCR. I use advanced and exact phrase, yet in the resulting images, every single word in that phrase is highlighted in the article individually.
-
18 votes
An error occurred while saving the comment J commentedThen when the page is highlighted to show your search phrases, each word in that exact phrase is shown individually. The search is on a par with that of Amazon, i.e. not fit for purpose.
-
8 votesJ supported this idea ·
-
37 votesJ supported this idea ·
An error occurred while saving the comment J commentedI cannot believe that I have been conned in to paying a subscription for abysmal OCR that is embarassing. Rather than keep adding new papers, it might serve them better to run their older papers through modern OCR software.
I've been a member for a week, and every single article i have found has errors, ranging from poor to abysmal. The reason I came to look here was a paper where the OCR hadn't recognised the columns, so merged two together, and within that hadn't recognised a tenth of the words. Yet there is nothing I can do to report it!
last responded in 2014 and still open-planned?