Ask Your Question
1

How to convert a PDF with tables to Calc spreadsheet?

asked 2019-07-14 12:26:04 +0200

vstepaniuk gravatar image

updated 2019-07-14 13:05:06 +0200

How to convert a PDF with tables to Calc spreadsheet in LibreOffice on Linux? (preferably in command line)

edit retag flag offensive close merge delete

Comments

A good question but with a sad answer, read on.

lazarus477 gravatar imagelazarus477 ( 2019-07-24 01:24:14 +0200 )edit

3 Answers

Sort by » oldest newest most voted
1

answered 2019-07-14 12:42:35 +0200

Opaque gravatar image

updated 2019-07-14 12:44:33 +0200

Hello,

not at all using LibreOffice. PDF is an exchange / output format (vectorized page description language) and as such not designed to be imported into editors or other programs for editing. Though LibreOffice Draw can import PDFs for editing this is to be considered a "workaround solution" by design which does not provide what you are searching for. There are tools on the internet claiming to perform a conversion to some editable format (i.e word, etc) but LibreOffice is no such tool.

edit flag offensive delete link more

Comments

Think of a PDF as a vector based image or photo of a document. Reading it as if it were a CSV file would be similar to using OCR software to convert a scanned document back into an editable word processing document.

lazarus477 gravatar imagelazarus477 ( 2019-07-24 01:23:27 +0200 )edit

yepp - and my answer just tells, that LibreOffice in the sense of your comment is not an OCR tool. So what's sad with the answer. Just the bad news?

Opaque gravatar imageOpaque ( 2021-02-26 18:05:02 +0200 )edit

For simple tables or short tables where you can use the space as a delimiter, that is, without spaces in the text of the table, you can do a manual conversion but not using Draw. Paying for a program or for conversion would be easier if you are doing more than just occasionally.

Open the pdf in a pdf reader such as Adobe Reader, copy the text (I mean text, numbers, dates as they are all just text) in the tables, paste into Calc. At this point you can add quote delimiters if needed to allow for spaces within some text, it will get tedious for more than just a few. Select all the text, click Data > Text to Columns and select Space as separator.

Earnest Al gravatar imageEarnest Al ( 2021-02-27 00:20:35 +0200 )edit
0

answered 2021-02-25 21:11:47 +0200

Todd2 gravatar image

I use https://www.ilovepdf.com/pdf_to_excel and then run it through https://virustotal.com to make sure there is no mischief involved (none so far)

edit flag offensive delete link more
0

answered 2021-02-26 17:53:22 +0200

simlpy me gravatar image

updated 2021-02-26 17:57:53 +0200

Able2Extract has a 7 day free trial. $39 for 1 month or $160 to purchase.
I am using it to extract tables from pdf to calc. The manual leaves a bit to be desired but their online help is fairly fast.

edit flag offensive delete link more
Login/Signup to Answer

Question Tools

1 follower

Stats

Asked: 2019-07-14 12:26:04 +0200

Seen: 2,632 times

Last updated: Feb 26