Personal tools

Log in: Login Name

Password

Cookies are not enabled. You must enable cookies before you can log in.; Forgot your password?; New user?

Recent RPM Changes: inspec-alicloud-0.10.24-1.lbn36.noarch May 16, 2024; inspec-vault-0.4.10-1.lbn36.noarch May 16, 2024; inspec-gcp-1.11.109-1.lbn36.noarch May 16, 2024; inspec-digitalocean-0.2.0-1.lbn36.noarch May 16, 2024; inspec-azure-1.118.43-1.lbn36.noarch May 16, 2024; inspec-aws-1.83.62-1.lbn36.noarch May 16, 2024; jenkins-plugin-hashicorp-vault-plugin-368.v48134f694db_f-1.lbn36.noarch May 15, 2024; jenkins-plugin-gson-api-2.10.1_15.v0d99f670e0a_7-1.lbn36.noarch May 15, 2024; jenkins-plugin-gradle-2.11-1.lbn36.noarch May 15, 2024; jenkins-slave-3206.vb_15dcf73f6a_9-1.lbn36.noarch May 15, 2024; Subscribe RSS…

You are here: Home / LBN / Up2date / Plone and Zope / BastionLinux 19 / transmogrify.webcrawler-1.2.1-7.lbn19.noarch

transmogrify.webcrawler-1.2.1-7.lbn19.noarch

Package Attributes

RPM transmogrify.webcrawler-1.2.1-7.lbn19.noarch.rpm

Architecture noarch

Size 1215826

Created 2019/09/30 06:55:35 UTC

Package Specification

Summary	Crawling and feeding html content into a transmogrifier pipeline
Group	Application/Internet
License	ZPL
Home Page	http://pypi.python.org/packages/source/t/transmogrify.webcrawler/transmogrify.webcrawler-1.2.1.zip
Description	A source blueprint for crawling content from a site or local html files. Webcrawler imports HTML either from a live website, for a folder on disk, or a folder on disk with html which used to come from a live website and may still have absolute links refering to that website. To crawl a live website supply the crawler with a base http url to start crawling with. This url must be the url which all the other urls you want from the site start with.
Requires	python(abi) collective.transmogrifier rpmlib(PayloadFilesHavePrefix) rpmlib(FileDigests) rpmlib(CompressedFileNames) python-beautifulsoup4 rpmlib(PartialHardlinkSets) rpmlib(PayloadIsXz) python-lxml
Provides	python2.7dist(transmogrify.webcrawler) python2dist(transmogrify.webcrawler) transmogrify.webcrawler
Obsoletes	transmogrify.webcrawler-egginfo