Ask Your Question
0

Why are LO documents so large compared with MS Word? [closed]

asked 2014-01-27 01:09:21 +0200

Enkel gravatar image

updated 2015-08-27 23:30:56 +0200

Alex Kemp gravatar image

I note when I have written a document and saved it in both LO (.odt) and Word (.doc or .docx), the LO document is typically twice as large as the Word document. Why is this? I thought the open document format was supposed to be more compact that the Office format.

edit retag flag offensive reopen merge delete

Closed for the following reason the question is answered, right answer was accepted by Alex Kemp
close date 2016-02-19 04:24:19.853405

3 Answers

Sort by » oldest newest most voted
2

answered 2014-01-27 07:22:57 +0200

oweng gravatar image

Here are some statistics based on tests with a text file version of The Histories by Herodotus. The UTF-8 text file is here (download, rename from JPG to ZIP and extract the TXT file) for others to test various versions of different products in a comparable manner.

Test parameters

The text file has these dimensions:

  • File size (bytes): 1540211 (on an ext3 volume).
  • Characters (including spaces): 1535760.
  • Words: 270607.
  • Lines: 33494 (it contains many CRLF characters to try and replicate the page layout of the original work).

I have tested opening this text file using:

  • LibreOffice Writer v4.1.4.2 Build ID: 0a0440ccc0227ad9829de5f46be37cfb6edcf72
  • MS Office 2011 Word for Mac v14.3.9

... and saving to these formats:

  • ODF v1.2 Extended i.e., ODT.
  • MS Office 97/2000/XP/2003 (.doc) i.e., the old binary format.
  • MS Office 2007/2010 XML (.docx) i.e., OOXML.

I have also tested re-saving the various DOC and DOCX files back to ODT.

Results

For the versions tested and for the same process when using plain text, ODT produces a noticeably smaller file when compared with the DOC and DOCX equivalent. Note that while file sizes can be directly compared on a per-equivalent-format basis, (e.g., DOC as saved by LO with DOC as saved by MSO), it is inaccurate to the compare percentage gains across formats where the original files differ in size. For this reason the percentages shown in the tables are all in relation to the size of the original text file. Cross-comparisons are more easily made in this way.

table of results

For the DOC and DOCX formats LO produces a:

  • Larger DOC file than MSO 2011 e.g., ~3830000 vs 2616320 bytes / ~250% vs ~170% of the TXT file.
  • Smaller DOCX file than MSO 2011 e.g., ~696000 vs 1043855 bytes / ~45% vs ~68% of the TXT file.

Round trips, whether into different formats or the same format, does vary these figures. I have not bothered to show repeated saves into the originating format, with the exception of .doc (MSO created) -> .doc (LO saved) and .docx (MSO created) -> .docx (LO saved). The trend in file sizes when saving to non-native formats will tend to be: earlier versions of LO will create smaller files and later versions larger files. This is due to improved understandings of the underlying specification and improved implementation of corner cases, etc.

Why are there differences?

So ... (more)

edit flag offensive delete link more
0

answered 2014-01-27 01:28:57 +0200

Enkel gravatar image

Thanks for answering so promptly. However the link you gave doesn't actually address the problem I put. My documents are text only - no pictures or anything in them. If I have created my documents myself then depending on how I save them (odt or doc/docx) the file size is wildly different, odt being typically twice the size of doc/docx files. And finally the size is consistent, that is if I start with a doc file, edit it and resave as a doc file the size remains broadly the same. Similarly with odt file. If I start with an odt file and save as a doc/docx file the size shrinks and conversely if I start with a doc/docx file and save as odt the size bloats.

edit flag offensive delete link more

Comments

@Enkel, can I get you to cut and paste this content from this answer back into your question, as addition information? Thanks for clarifying. I will see if I can expand on my original answer (in the linked thread by @mariosv), but want to do some testing of this first.

oweng gravatar imageoweng ( 2014-01-27 01:54:41 +0200 )edit

@Enkel, please what are sizes on what are you talking?

m.a.riosv gravatar imagem.a.riosv ( 2014-01-27 03:26:25 +0200 )edit

Enkel, you have such a high rank, yet you don't have the smartness to NOT POST AN ANSWER TO A QUESTION OF YOURS THAT ISN'T AN ANSWER AT ALL !

rautamiekka gravatar imagerautamiekka ( 2015-06-01 16:08:10 +0200 )edit

Enkel, you have such a high rep, yet you don't have the smartness to NOT POST AN ANSWER TO A QUESTION OF YOURS THAT ISN'T AN ANSWER AT ALL !

rautamiekka gravatar imagerautamiekka ( 2015-06-01 16:08:28 +0200 )edit
0

answered 2014-01-27 01:14:12 +0200

m.a.riosv gravatar image
edit flag offensive delete link more

Question Tools

1 follower

Stats

Asked: 2014-01-27 01:09:21 +0200

Seen: 2,820 times

Last updated: Jan 27 '14