You could alternatively use the library “ PyMuPDF”. Here is a short snippet implementation of it.
import sys, fitz
def extractText(file):
doc = fitz.open(file)
text = []
for page in doc:
t = page.getText().encode(“utf8”)
text.append(t)
return text
You could alternatively use the library “ PyMuPDF”. Here is a short snippet implementation of it.
import sys, fitz
def extractText(file):
doc = fitz.open(file)
text = []
for page in doc:
t = page.getText().encode(“utf8”)
text.append(t)
return text
Distraction-free reading. No ads.
Organize your knowledge with lists and highlights.
Tell your story. Find your audience.
Read member-only stories
Support writers you read most
Earn money for your writing
Listen to audio narrations
Read offline with the Medium app
Co-Founder of Aryma Labs. Data scientist/Statistician with business acumen. Hoping to amass knowledge and share it throughout my life. Rafa Nadal Fan.