Ask Your Question

Headless conversion to HTML embeds images instead of creating separate files [closed]

asked 2015-10-02 16:49:11 +0100

jlentz gravatar image

updated 2020-10-25 11:20:55 +0100

Alex Kemp gravatar image

We've been using libreoffice's headless conversion to convert Word documents to HTML files. On version 4.0, it would create an HTML file and separate JPG files for any images embedded in the Word document and reference them via the img tag src attribute.

Now that we've upgraded to 4.2, the conversion only creates the HTML file, with all of the images embedded inline as base 64 encoded data-src attributes (eg.

Is there a way to make the libreoffice conversion create the individually linked image files again? Here's the command we are using for the conversion:

soffice --headless --convert-to html:HTML file_to_convert.docx

edit retag flag offensive reopen merge delete

Closed for the following reason question is not relevant or outdated by Mike Kaganski
close date 2017-12-27 23:47:25.003824

1 Answer

Sort by » oldest newest most voted

answered 2017-12-27 23:47:13 +0100

This was resolved in tdf#48887, and by default, the images are saved as separate files again. Optionally, one can force LibreOffice to embed the images, using this command line:

soffice --convert-to html:HTML:EmbedImages file_to_convert

Closing as outdated.

edit flag offensive delete link more

Question Tools

1 follower


Asked: 2015-10-02 16:49:11 +0100

Seen: 2,328 times

Last updated: Dec 27 '17