• 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
How to create CSV files for PDF fakebooks
#4
You can also use tesseract to read an image of the table of contents and create a text file that you can then edit into a csv file.

This method works really well or really poorly depending on the quality and resolution of the source image file.  Sometimes it's as easy as typing the tesseract command and then adding the semicolons.

Sometimes it's easier to just retype the whole thing from scratch.

Usually it's something in the middle where you need to do a bit of fix-up on a few titles or misread page number.  "My 01d Kentucky Home" for example.

But it's usually more efficient than typing the whole thing in again from scratch.

After creating the csv file you need to check to be sure that the page numbers from the table of contents matches the pages in the pdf file.  If they don't you can easily change them (add 2 to each page number, for example) using a spreadsheet.
If you're a zombie and you know it, bite your friend!
We got both kinds of music: Country AND Western
Reply


Messages In This Thread
RE: How to create CSV files for PDF fakebooks - by Frank Cox - 12-20-2023, 09:13 AM



Users browsing this thread:
1 Guest(s)


  Theme © 2014 iAndrew  
Powered By MyBB, © 2002-2024 MyBB Group.