Hi,
I am developing the project SDAPS (http://sdaps.org). Right now the project reads in both the ODT file and PDF file, to extract metadata (from style names) and positions of text fields (by parsing the PDF file). I would like to extract this information directly from LibreOffice using python3-uno.
Ideally, one would develop a new way to describe these forms (it would be neat if it also was a PDF form). However, there are some constraints:
- Check-/Radioboxes should have a specific border width (exactly 1pt)
- The absolute position needs to be exported (one way might be by using XML layout export information)
- export the text belonging to a checkbox
- multicolumn layout/right to left languages should work
- include more metadata (i.e. assigning variables/values to checkboxes for choice/range questions)
In general, it would seem sane to use normal form elements. However, it does not seem possible to specify exactly how they should be drawn. Right now we are using empty text fields with a known size and border.
An example (document) can be found at:
- http://sdaps.org/Documentation/LibreOffice
- https://github.com/benzea/sdaps/blob/master/test/data/odt-3/debug.odt?raw=true
I’ll be happy to answer more questions about this. I would love to hear ideas about how this could be done nicely. An important aspect is of course to make it simple to use; the style based system right now is rather hard to use, and even worst to debug.