Personal tools

Log in: Login Name

Password

Cookies are not enabled. You must enable cookies before you can log in.; Forgot your password?; New user?

Recent RPM Changes: bastion-cloud-init-36.0.1_1.6.24-1.lbn36.noarch Sep 11, 2025; python3-lbn-cloud-init-1.6.24-1.lbn36.noarch Sep 11, 2025; python3-lbn-cloud-init+cloudinit-36.0.1_1.6.24-1.lbn36.noarch Sep 11, 2025; python3-google-api-core+grpcio-gcp-2.23.0-3.lbn36.noarch Sep 11, 2025; python3-google-api-core+grpcgcp-2.23.0-3.lbn36.noarch Sep 11, 2025; python3-google-api-core+grpc-2.23.0-3.lbn36.noarch Sep 11, 2025; python3-google-api-core-2.23.0-3.lbn36.noarch Sep 11, 2025; python3-docstring-parser-0.17.0-2.lbn36.noarch Sep 11, 2025; python3-propcache-0.2.0-4.lbn36.x86_64 Sep 11, 2025; python3-rich-14.1.0-2.lbn36.noarch Sep 11, 2025; Subscribe RSS…

You are here: Home / LBN / Up2date / Plone and Zope / BastionLinux 13 / transmogrify.webcrawler-1.2.1-2.lbn13.noarch

transmogrify.webcrawler-1.2.1-2.lbn13.noarch

Package Attributes

RPM transmogrify.webcrawler-1.2.1-2.lbn13.noarch.rpm

Architecture noarch

Size 1219768

Created 2017/08/04 11:07:55 UTC

Package Specification

Summary	Crawling and feeding html content into a transmogrifier pipeline
Group	Application/Internet
License	ZPL
Home Page	http://pypi.python.org/packages/source/t/transmogrify.webcrawler/transmogrify.webcrawler-1.2.1.zip
Description	A source blueprint for crawling content from a site or local html files. Webcrawler imports HTML either from a live website, for a folder on disk, or a folder on disk with html which used to come from a live website and may still have absolute links refering to that website. To crawl a live website supply the crawler with a base http url to start crawling with. This url must be the url which all the other urls you want from the site start with.
Requires	python-BeautifulSoup python(abi) collective.transmogrifier rpmlib(PayloadFilesHavePrefix) rpmlib(FileDigests) /bin/sh python-ordereddict rpmlib(CompressedFileNames) rpmlib(PartialHardlinkSets) rpmlib(PayloadIsXz) python-lxml
Provides	transmogrify.webcrawler
Obsoletes	transmogrify.webcrawler-egginfo