* No searchable text in PDF reports

Questions regarding use of any Version of Family Historian. Please ensure you have set your Version of Family Historian in your Profile. If your question fits in one of these subject-specific sub-forums, please ask it there.
User avatar
PeterR
Megastar
Posts: 1129
Joined: 10 Jul 2006 16:55
Family Historian: V7
Location: Northumberland, UK

No searchable text in PDF reports

Post by PeterR » 30 Dec 2020 16:23

I've noticed a problem with all the Report types I've used so far, and also a Book:
when saved as PDF, every page consists of a single graphic image, with no searchable text. This explains two problems mentioned elsewhere: much larger file sizes, and much longer times. Printing to a PDF printer is no better. Searchable PDF files (as in earlier FH versions) would be much more useful.

By contrast all the Diagrams I've so far saved to PDF still have searchable text (as do Query Results) which can be a useful feature.

Now reported to CP: Ticket #422085.
Last edited by PeterR on 30 Dec 2020 17:20, edited 1 time in total.
Peter Richmond (researching Richmond, Bulman, Martin, Driscoll, Baxter, Hall, Dales, Tyrer)

User avatar
tatewise
Megastar
Posts: 27078
Joined: 25 May 2010 11:00
Family Historian: V7
Location: Torbay, Devon, UK
Contact:

Re: No searchable text in PDF reports

Post by tatewise » 30 Dec 2020 16:38

I don't recall that anyone has spotted that significant change, which would be a backward step IMO.
So not only do PDF take much longer to produce, and create much larger files, but they are not searchable ~ assuming you can create them at all, and I cannot.
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry

User avatar
Mark1834
Megastar
Posts: 2146
Joined: 27 Oct 2017 19:33
Family Historian: V7
Location: South Cheshire, UK

Re: No searchable text in PDF reports

Post by Mark1834 » 30 Dec 2020 16:45

Well done Peter, that explains what we have seen, and why text quality is so downgraded when downscaling images.

I'm not sure how to interpret that. I can't believe it was a deliberate move to take out that functionality (it means we can't copy text either), but neither can I credit that an established software house would make such a boo boo by accident...
Mark Draper

User avatar
AdrianBruce
Megastar
Posts: 1961
Joined: 09 Aug 2003 21:02
Family Historian: V7
Location: South Cheshire
Contact:

Re: No searchable text in PDF reports

Post by AdrianBruce » 30 Dec 2020 16:58

Definite thanks from me as well, Peter - even though I'm a Middle Ages v6 user!

I have absolutely no experience of direct use of .PDF creation routines so I have no idea whether that's a simple slip up or what...
Adrian

User avatar
LornaCraig
Megastar
Posts: 2989
Joined: 11 Jan 2005 17:36
Family Historian: V7
Location: Oxfordshire, UK

Re: No searchable text in PDF reports

Post by LornaCraig » 30 Dec 2020 17:02

I seem to recall that during beta testing someone picked up on the non-searchable text, and it was logged as a suggestion. I don't recall any indication of whether it was deliberate or not.
Lorna

User avatar
paultt
Famous
Posts: 114
Joined: 18 Jan 2005 21:59
Family Historian: V7
Location: Hampshire, England
Contact:

Re: No searchable text in PDF reports

Post by paultt » 30 Dec 2020 17:03

Ahhh! That explains why I cannot get any of the hyperlinks carefully constructed within the new editor to work from a generated pdf file. Never dreamt that the whole page was made into one image by the fh&nova7 combination!
Well spotted. I think I may have to consider going back to v6!

User avatar
Mark1834
Megastar
Posts: 2146
Joined: 27 Oct 2017 19:33
Family Historian: V7
Location: South Cheshire, UK

Re: No searchable text in PDF reports

Post by Mark1834 » 30 Dec 2020 17:19

I realise we are trying to second guess Calico Pie, but I've tried printing an existing 50 page Word document containing mixed text and graphics using the FH PDF printer. It's a larger file than Microsoft Print to PDF produces (10.7 vs 6.5 MB), but the text is still fully searchable text. We know the FH reports are still text when displayed, as they output properly to RTF, so where is the conversion occurring...?
Mark Draper

User avatar
tatewise
Megastar
Posts: 27078
Joined: 25 May 2010 11:00
Family Historian: V7
Location: Torbay, Devon, UK
Contact:

Re: No searchable text in PDF reports

Post by tatewise » 30 Dec 2020 17:44

The conversion is when FH formats the Report before sending it as an image per page to the Family Historian Nove PDF.
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry

User avatar
dewilkinson
Superstar
Posts: 280
Joined: 04 Nov 2016 19:05
Family Historian: V7
Location: Oundle, Northamptonshire, England
Contact:

Re: No searchable text in PDF reports

Post by dewilkinson » 30 Dec 2020 18:54

To get a searchable pdf which is also a sensible size Save As rtf, load it into Word and Save As a pdf. I shall be using this route for the foreseeable future (tier 4 willing!)
David Wilkinson researching Bowtle, Butcher, Edwards, Gillingham, Overett, Ransome, Simpson, and Wilkinson in East Anglia

Deterioration is contagious, and places are destroyed or renovated by the spirit of the people who go to them

User avatar
tatewise
Megastar
Posts: 27078
Joined: 25 May 2010 11:00
Family Historian: V7
Location: Torbay, Devon, UK
Contact:

Re: No searchable text in PDF reports

Post by tatewise » 30 Dec 2020 20:46

But as has been said before, that route does not work well if media images are required in the Report.

It does seem inconceivable that Calico Pie have deliberately gone down this path.

One neat trick with actual text in the PDF file is that a URL becomes an active hyperlink, and that feature is now missing.
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry

User avatar
dewilkinson
Superstar
Posts: 280
Joined: 04 Nov 2016 19:05
Family Historian: V7
Location: Oundle, Northamptonshire, England
Contact:

Re: No searchable text in PDF reports

Post by dewilkinson » 30 Dec 2020 20:57

Indeed but work arounds are never perfect, lesser of two evils? The only other option I can see is manipulating images in Word before generating the pdf, but for a large report that is going to be tedious if not impossible. I'm with you, I can't believe Calico Pie has done this deliberately. It has crossed my mind to look other packages but I shan't do anything immediately as I thought FH6 was the best available, but ...
David Wilkinson researching Bowtle, Butcher, Edwards, Gillingham, Overett, Ransome, Simpson, and Wilkinson in East Anglia

Deterioration is contagious, and places are destroyed or renovated by the spirit of the people who go to them

avatar
davepacey
Famous
Posts: 135
Joined: 22 Nov 2002 19:00
Family Historian: V7
Location: Lincolnshire, UK

Re: No searchable text in PDF reports

Post by davepacey » 30 Dec 2020 21:09

Until the reports are fixed, I have no intention of getting rid of V6.27 the reporting is far superior, its just the hassle of swapping between the two, but the plugin route works just fine for me.
Dave Pacey - Lincolnshire UK

User avatar
Andy63
Gold
Posts: 19
Joined: 23 Nov 2020 04:46
Family Historian: V7
Location: Québec, Canada

Re: No searchable text in PDF reports

Post by Andy63 » 01 Jan 2021 00:39

Personally, I think they rush it out for Christmas and the PDF part of the software was left unfinished. It's impossible they didn't knew about it.

User avatar
Mark1834
Megastar
Posts: 2146
Joined: 27 Oct 2017 19:33
Family Historian: V7
Location: South Cheshire, UK

Re: No searchable text in PDF reports

Post by Mark1834 » 01 Jan 2021 10:09

It's a difficult balance for software houses. Do they release a new version with known issues or wait until it is perfect but earn nothing from it? There is no right answer. A month ago, RootsMagic announced a public beta of version 8 by year-end, but it has not appeared. If Calico Pie were working to a firm year-end deadline, they had to make a judgement call on whether to leave this unfixed or not.

We have no way of knowing how many users this major flaw impacts - is it just a small vocal minority, or a significant number? The risk for CP is that magazine reviewers and bloggers pick it up, and reputations are lost much more quickly than they are gained.
Mark Draper

User avatar
Mark1834
Megastar
Posts: 2146
Joined: 27 Oct 2017 19:33
Family Historian: V7
Location: South Cheshire, UK

Re: No searchable text in PDF reports

Post by Mark1834 » 01 Jan 2021 11:01

A month ago, RootsMagic announced a public beta of version 8 by year-end, but it has not appeared.
Correction - it wasn't there yesterday afternoon (UK time), but is now available - just in time planning!
Mark Draper

User avatar
ColeValleyGirl
Megastar
Posts: 4853
Joined: 28 Dec 2005 22:02
Family Historian: V7
Location: Cirencester, Gloucestershire
Contact:

Re: No searchable text in PDF reports

Post by ColeValleyGirl » 01 Jan 2021 11:19

One software product I use entered beta testing for a new version on 20 November.

2017.

It has just updated its projected release date to 2021. I can only assume they're aiming for zero bugs, which is laudable but very annoying...

I think I prefer CP's approach -- PDF production isn't core functionality, after all, and there are workarounds (albeit inconvenient) for those who want to produce PDFs.

User avatar
PeterR
Megastar
Posts: 1129
Joined: 10 Jul 2006 16:55
Family Historian: V7
Location: Northumberland, UK

Re: No searchable text in PDF reports

Post by PeterR » 01 Jan 2021 12:02

PDF File Support has been part of FH functionality since at least version 5.0, almost 10 years ago, as the following quote from the Help topic shows (unchanged between FH5 and FH7):
Saving items in PDF file format is very convenient for a number of purposes - here are some:
It can be a useful way of sending charts, reports and books to other people - for example, as email attachments.
Good luck trying to send the bloated PDFs from FH7 by email!

Why have CP broken some functionality that already worked perfectly?
Peter Richmond (researching Richmond, Bulman, Martin, Driscoll, Baxter, Hall, Dales, Tyrer)

User avatar
AdrianBruce
Megastar
Posts: 1961
Joined: 09 Aug 2003 21:02
Family Historian: V7
Location: South Cheshire
Contact:

Re: No searchable text in PDF reports

Post by AdrianBruce » 01 Jan 2021 17:05

PeterR wrote:
01 Jan 2021 12:02
... Why have CP broken some functionality that already worked perfectly?
Why does any software company deliberately break functionality that already worked perfectly? They don't. Seriously, having worked for a software organisation, they don't. Not unless there are other issues elsewhere and it's a least worst option.

For most of us, there is a decent temporary work-round - saving the report to .RTF and PDFing it from our favourite word-processor. I wouldn't be surprised if that disrupts image positioning in some reports, that's always the first thing to go wonky in my experience, so it's not necessarily 100% effective.

I have no idea how to set Nova PDF (assuming that's what involved) to produce image-only PDFs - apparently it's as simple as a "Print as Image" checkbox in a dialog box in the full-fat Adobe Acrobat - so whether it's possible to accidentally configure an internal, unchangeable default for FH's routine, I've no idea. But that may be what's happened here. Just a complete guess from me.
Adrian

avatar
RS3100
Famous
Posts: 240
Joined: 05 Nov 2020 12:16
Family Historian: V7
Location: Hertfordshire, UK

Re: No searchable text in PDF reports

Post by RS3100 » 01 Jan 2021 17:14

I think that the formatting as an image is happening within FH, before the data is passed to the Nova PDF driver, since I've found that reports are printed as images in other PDF drivers that are installed on my PC, such as the CutePDF, Foxit and Quicken drivers.

I normally print to CutePDF and nothing has changed with that driver since I "upgraded" to FH7. It printed text searchable reports that I sent to it from FH6.2.

User avatar
PeterR
Megastar
Posts: 1129
Joined: 10 Jul 2006 16:55
Family Historian: V7
Location: Northumberland, UK

Re: No searchable text in PDF reports

Post by PeterR » 01 Jan 2021 17:24

I agree with RS3100. I have also tried a variety of PDF printers. The problem is not within NovaPDF. CP have accidentally or otherwise changed the way reports and books are output from FH for printing or saving as PDF. And I've tried the RTF route, but the printed layout is significantly inferior.
Peter Richmond (researching Richmond, Bulman, Martin, Driscoll, Baxter, Hall, Dales, Tyrer)

User avatar
AdrianBruce
Megastar
Posts: 1961
Joined: 09 Aug 2003 21:02
Family Historian: V7
Location: South Cheshire
Contact:

Re: No searchable text in PDF reports

Post by AdrianBruce » 01 Jan 2021 17:34

RS3100 wrote:
01 Jan 2021 17:14
I think that the formatting as an image is happening within FH, before the data is passed to the Nova PDF driver, ...
Hmm. It does sound like there's either a common API to all those PDF drivers, one that we can't access, that has accidentally been set up to imagize PDFs, or FH is always calling an imagize routine, when there was meant to be a dialog box somewhere in FH, Printer Set-up or whatever, that gave the option "Print As Image". Weird. Hopefully it is just a plumbing issue along those lines.
Adrian

User avatar
AdrianBruce
Megastar
Posts: 1961
Joined: 09 Aug 2003 21:02
Family Historian: V7
Location: South Cheshire
Contact:

Re: No searchable text in PDF reports

Post by AdrianBruce » 01 Jan 2021 17:36

PeterR wrote:
01 Jan 2021 17:24
... I've tried the RTF route, but the printed layout is significantly inferior.
Darn - I was afraid of that...
Adrian

User avatar
paultt
Famous
Posts: 114
Joined: 18 Jan 2005 21:59
Family Historian: V7
Location: Hampshire, England
Contact:

Re: No searchable text in PDF reports

Post by paultt » 01 Jan 2021 17:56

For me, the RTF file route and then to PDF does not work. As soon as we go to .rtf, it will be assumed that the output will be on paper, and I cannot get a link to click on paper :D
I tested with a small report which I had entered website hyperlinks into the notes. If I save it as an html page, the links work ok.
If I redirect the report to the standard Micro$oft PDF printer that comes with windows, I get a blank pdf file, which is useless.
If you DO manage to get an rtf file to load it into say Word or Wordpad, it will mean searching for ALL the text that was a hyperlink, and actually setting the wordage to a hyperlink within Word, and then sending it to a pdf generator, like the Micro$oft PDF printer!
There are so many things in v7 for which I have had to change my workflow, and they nearly all involve extra mouse clicks or tabs, or manual interventions, that FH is not an easy and friendly genealogy package anymore. Try copying some text say from a website or decent pdf, or even your email signature which has a link to your website, into one of the 'new' note fields using the rich text formatting. I have found that the paste actually messes up the formatting of the original, which is what I was hoping to keep. Lots of hidden gotchas, not just the pdf fiasco.

User avatar
fhtess65
Megastar
Posts: 525
Joined: 15 Feb 2018 21:34
Family Historian: V7
Location: British Columbia, Canada
Contact:

Re: No searchable text in PDF reports

Post by fhtess65 » 01 Jan 2021 19:36

I was able to make the export to .rtf and then publish to .pdf work, however, I had to use Papyrus Author. I tried using both Word and Atlantis, but neither couldn't read the table from FH7. I then tried in LibreOffice, and same result as Word and Atlantis. See attached...

Very interesting that only a very specialized piece of software like Papyrus Author (free version) was able to open and render the .rtf properly. Not many people would even have it. I will try it in Scrivener as well and see what happens.
Attachments
IndivSummaryReport_rtf_inLibreOffice.jpg
IndivSummaryReport_rtf_inLibreOffice.jpg (117.55 KiB) Viewed 5640 times
IndivSummaryReport_rtf_inWord.jpg
IndivSummaryReport_rtf_inWord.jpg (125.37 KiB) Viewed 5640 times
IndivSummaryReport_rtf_inAtlantis.jpg
IndivSummaryReport_rtf_inAtlantis.jpg (73.59 KiB) Viewed 5640 times
Last edited by fhtess65 on 02 Jan 2021 17:39, edited 1 time in total.
---
Teresa Basińska Eckford
Librarian & family historian
http://writingmypast.wordpress.com
Researching: Spong, Ferdinando, Taylor, Lawley, Sinkins, Montgomery; Basiński, Hilferding, Ratowski, Paszkiewicz

User avatar
AdrianBruce
Megastar
Posts: 1961
Joined: 09 Aug 2003 21:02
Family Historian: V7
Location: South Cheshire
Contact:

Re: No searchable text in PDF reports

Post by AdrianBruce » 01 Jan 2021 20:42

Hmmm. Thanks Teresa - looks like my idea of save as .RTF doesn't work with a lot of stuff, then.

I'm beginning to wonder if this transformation into an image is deliberate after all - if MS Word can't cope with some of the .RTF formatting - well, maybe NovaPDF has issues with whatever its text-based input format is as well? Hence rendering it as an image?

I have no idea about versions of RTF - whether NovaPDF and MS Word are behind in their versions of RTF (or whatever NovaPDF uses as input) - I guess that if Papyrus Author can cope, that may be so. Maybe CP can't find the correct RTF coding to render what they need in NovaPDF, so have rendered it as an image? More and more, it looks not nice to me ... Or maybe I'm thinking too much... :?
Adrian

Post Reply