PDF extract data (solved)

I’m struggling to extract specific values from a pdf file. Currently I’m using HA pdf (GitHub - emcniece/ha_pdf: Home Assistant PDF File Sensor) and can extract some values from a pdf file, but not all values I need. I’m using regex101.com to test the expressions and they all work fine online, but not always in HA, some will work, some not and I don’t know why.
another option I tried is to convert the pdf to a xlsx file, I have better results but still not all I need. There are free online pdf to xlsx converters, these work perfect, would be great if this is possible in HA.
so can someone help me with:

  • pointing me to a online pdf regex tester?
  • is there a good converter / extractor (integration) to use with HA?
  • can I automate the online conversion through HA?
  • something else?

In case someone is having a similar problem, I found a solution.
I installed camelot on a rpi where I have Pihole running to convert pdf files coming from HA (I have HAOS running on another rpi) and sending it back as an excel file, and it’s working perfectly!