LibreOffice convert to html default charset=iso-8859-1

asked 2017-02-09 08:10:29 +0200

rigmantas gravatar image

updated 2017-02-09 08:24:19 +0200

Hi, Mabe I mistake, but in 2017-01-31 from terminal window LibreOffice convert to html utf-8 changed to iso-8859-1. I tested in multiple virtual servers and can't iso-8859-1 change to utf-8. There is my used commands: unoconv --format html /0/bb.docx unoconv --format html --export FilterOptions=UTF8,LF /0/aa.odt unoconv -f html -i FilterOptions=UTF8 /0/aa.odt soffice --headless --convert-to html --outdir /0 /0/bb.docx

and there result convert docx to html Ąčęėįšųū9žąČĘĖĮŠŲŽ -> Ąčęėįšųū9žąČĘĖĮŠŲŽ there source : image description link text

<html> <head> <meta http-equiv="content-type" content="text/html; charset=iso-8859-1"/> <title></title> <meta name="generator" content="LibreOffice 5.0.6.2 (Linux)"/> <meta name="author" content="Rimantas"/> <meta name="created" content="2017-02-08T08:19:00"/> <meta name="changedby" content="Rimantas"/> <meta name="changed" content="2017-02-08T08:19:00"/> <meta name="AppVersion" content="15.0000"/> <meta name="DocSecurity" content="0"/> <meta name="HyperlinksChanged" content="false"/> <meta name="LinksUpToDate" content="false"/> <meta name="ScaleCrop" content="false"/> <meta name="ShareDoc" content="false"/> <style type="text/css"> @page { size: 8.27in 11.69in; margin-left: 1.18in; margin-right: 0.39in; margin-top: 1.18in; margin-bottom: 0.79in } p { margin-bottom: 0.1in; direction: ltr; line-height: 120%; text-align: left; orphans: 2; widows: 2 } </style> </head> <body lang="lt-LT" dir="ltr">

Ąčęėįšųū9žąČĘĖĮŠŲŽ

</body> </html> Thanks, Regards Rimantas

edit retag flag offensive close merge delete