Report this

What is the reason for this report?

Looking for a Widget and/or PlugIn to search keywords or phrases in all searchable PDFs that are stored in a certain folder or web location

Posted on July 25, 2020

Hello. I am looking for a widget and/or plugin to search keywords or phrases in all searchable PDFs that are stored in a certain folder or web location on Digital Ocean. Thanks.



This textbox defaults to using Markdown to format your answer.

You can type !ref in this text area to quickly search our full set of tutorials, documentation & marketplace offerings and insert the link!

These answers are provided by our Community. If you find them useful, show some love by clicking the heart. If you run into issues leave a comment, or add your own answer to help others.

Hi there @AtlInq,

It would depend on how exactly you need to use that tool, for example, if you want to just be able to run it via a script, you could use Python with PyPDF2.

The script itself would look something like this:

# import packages
import PyPDF2
import re

# open the pdf file
object = PyPDF2.PdfFileReader("test.pdf")

# get number of pages
NumPages = object.getNumPages()

# define keyterms
String = "Social"

# extract text and do the search
for i in range(0, NumPages):
    PageObj = object.getPage(i)
    print("this is page " + str(i)) 
    Text = PageObj.extractText() 
    # print(Text)
    ResSearch = re.search(String, Text)
    print(ResSearch)

Source.

Regards, Bobby

The developer cloud

Scale up as you grow — whether you're running one virtual machine or ten thousand.

Get started for free

Sign up and get $200 in credit for your first 60 days with DigitalOcean.*

*This promotional offer applies to new accounts only.