At work we have a document scanner that outputs files to PDF and e-mails it to us, but the PDFs are really just full-page images mashed together as a PDF because the scanner doesn’t have OCR capability.
Here’s how to extract the text using Microsoft Office 2003 or 2007. It’s imperfect, but here’s what you can do with the tools you already have.
Read more
David Farquhar is a computer security professional, entrepreneur, and author. He started his career as a part-time computer technician in 1994, worked his way up to system administrator by 1997, and has specialized in vulnerability management since 2013. He invests in real estate on the side and his hobbies include O gauge trains, baseball cards, and retro computers and video games. A University of Missouri graduate, he holds CISSP and Security+ certifications. He lives in St. Louis with his family.