Personal tools

Log in Register

Sections

Skip to content. | Skip to navigation

Log in: Login Name

Password

Cookies are not enabled. You must enable cookies before you can log in.; Forgot your password?; New user?

Recent RPM Changes: python3-lbn-robot-0.0.2-1.lbn36.noarch Apr 21, 2024; python3-browserstack-sdk-1.19.24-1.lbn36.noarch Apr 21, 2024; browserstack-local-8.9-1.lbn36.x86_64 Apr 21, 2024; selenium-drivers-geckodriver-0.34.0-1.lbn36.x86_64 Apr 19, 2024; selenium-drivers-chromedriver-122.0.6261.128-1.lbn36.x86_64 Apr 19, 2024; python3-robotframework-djangolibrary-3.1.0-1.lbn36.noarch Apr 19, 2024; python3-robotframework-7.0-1.lbn36.noarch Apr 19, 2024; python-robot-browserstack-0.0.2-1.lbn36.noarch Apr 19, 2024; python3-jenkins-job-builder-6.2.0-1.lbn36.noarch Apr 19, 2024; python-selenium-manager-0.0.2-1.lbn36.noarch Apr 19, 2024; Subscribe RSS…

You are here: Home / LBN / Up2date / Plone and Zope / BastionLinux 19 / python-textract-1.4.0-1.lbn19.noarch

python-textract-1.4.0-1.lbn19.noarch

Package Attributes

RPM python-textract-1.4.0-1.lbn19.noarch.rpm

Architecture noarch

Size 80096

Created 2019/09/30 06:54:08 UTC

Package Specification

Summary	extract text from any document. no muss. no fuss.
Group	Application/Internet
License	ZPL
Home Page	https://pypi.python.org/packages/source/t/textract/textract-1.4.0.tar.gz
Description	As undesireable as it might be, more often than not there is extremely useful information embedded in Word documents, PowerPoint presentations, PDFs, etc---so-called "dark data"---that would be valuable for further textual analysis and visualization. While :ref:`several packages ` exist for extracting content from each of these formats on their own, this package provides a single interface for extracting content from any type of file, without any irrelevant markup. Currently supporting textract supports a growing list of file types for text extraction. If you don't see your favorite file type here, Please recommend other file types by either mentioning them on the issue tracker or by :ref:`contributing a pull request `. .csv via python builtins .doc via antiword .docx via python-docx .eml via python builtins .epub via ebooklib .gif via tesseract-ocr .jpg and .jpeg via tesseract-ocr .json via python builtins .html and .htm via beautifulsoup4 .mp3 via SpeechRecognition and sox .msg via msg-extractor .odt via python builtins .ogg via SpeechRecognition and sox .pdf via pdftotext (default) or pdfminer .png via tesseract-ocr .pptx via python-pptx .ps via ps2text .rtf via unrtf .tiff via tesseract-ocr .txt via python builtins .wav via SpeechRecognition .xlsx via xlrd .xls via xlrd
Requires	python-chardet python-speechrecognition python(abi) rpmlib(PayloadFilesHavePrefix) rpmlib(FileDigests) python-pptx python-ebooklib python-argcomplete rpmlib(CompressedFileNames) python-extractmsg python-beautifulsoup4 python-docx /usr/bin/python2.7 rpmlib(PartialHardlinkSets) rpmlib(PayloadIsXz) python-pdfminer python-xlrd
Provides	python-textract

Document Actions

Send this

Distributions: To see exactly what is included in BastionLinux™, visit our online Builder.

Buy Now: Subscribe Now and get BastionLinux™ ...

LBN News: BastionLinux/OpenStack/Zed Released Mar 05, 2023; BastionLinux 36 Released Jan 27, 2023; BastionLinux/OpenStack/Train Released Mar 08, 2020; BastionLinux/Openstack/CI/CD on AWS/Marketplace Feb 01, 2020; BastionLinux/Grafana on AWS/Marketplace Jan 26, 2020; LBN News - More…

Sponsored Links

AWS/Marketplace

Follow Us

Email

Public Relations
News
Events

Documentation
User Manuals
API Docs
Plone

Downloads
BastionLinux
Bastion FOSS

Support
Plans
Helpdesk
Contact Us


	Copyright © 2000-2024 Corporation of Balclutha Inc. All rights reserved.