I search for a software that compares texts. It seems easy... I explain my problem :
I have a word document with a certain text (see below for an example) and I want to compare it with a pdf file that should contain this text (typically a label that we paste on products). The goal is to know if the word text is the same as the one on label. But the sentences can be in different order...
Here an example :
I have a word document with :
H225 Highly flammable liquid and vapour.
H318 Causes serious eye damage.
Hazardous to the ozone layer.
And I have a PDF file with the text in 2 columns :
I don't want to check manually if the sentences are the same (it's not necessarily in English).
I tried Google vision for character recognition, but it doesn't compare. I tried also ABBY FineReader 14, but if the sentences aren't in the same order, it says that the text has been removed/added...