Venkat Raman
1 min readJan 16, 2019

Hi Shaurya,

By and large the resumes are in PDF format. Hence the code was written to include that format. To include doc or docx, u can easily use libraries like textract or docx2txt. For HTML, I guess u meant web scrapping, in that case u can use libraries like beautifulsoup. I have not come across resumes in ppt format but there is a library called python pptx which can help you extract text from ppt.

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Venkat Raman
Venkat Raman

Written by Venkat Raman

Co-Founder of Aryma Labs. Data scientist/Statistician with business acumen. Hoping to amass knowledge and share it throughout my life. Rafa Nadal Fan.

No responses yet

Write a response