I’m struggling to extract specific values from a pdf file. Currently I’m using HA pdf (GitHub - emcniece/ha_pdf: Home Assistant PDF File Sensor) and can extract some values from a pdf file, but not all values I need. I’m using regex101.com to test the expressions and they all work fine online, but not always in HA, some will work, some not and I don’t know why.
another option I tried is to convert the pdf to a xlsx file, I have better results but still not all I need. There are free online pdf to xlsx converters, these work perfect, would be great if this is possible in HA.
so can someone help me with:
- pointing me to a online pdf regex tester?
- is there a good converter / extractor (integration) to use with HA?
- can I automate the online conversion through HA?
- something else?