You are here: Home / LBN / Up2date / Business / BastionLinux 36 / python3-transmogrify-htmlcontentextractor-1.0-1.lbn36.noarch

python3-transmogrify-htmlcontentextractor-1.0-1.lbn36.noarch

Package Attributes
RPM  python3-transmogrify-htmlcontentextractor-1.0-1.lbn36.noarch.rpm Architecture  noarch Size  603005 Created  2022/09/06 06:03:08 UTC
Package Specification
Summary This blueprint extracts out title, description and body from html either via xpath or by automatic cluster analysis
Group Unspecified
License GPL
Home Page http://github.com/djay/transmogrify.htmlcontentextractor
Description

Introduction Helpful transmogrifier blueprints to extract text or html out of html content. transmogrify.htmlcontentextractor.auto This blueprint has a clustering algorithm that tries to automatically extract the content from the HTML template. This is slow and not always effective. Often you will need to input your own template extraction rules. In addition to extracting Title, Description...

Requires
rpmlib(PayloadFilesHavePrefix)  
rpmlib(FileDigests)  
rpmlib(PartialHardlinkSets)  
rpmlib(CompressedFileNames)  
rpmlib(PayloadIsZstd)  
Provides
python-transmogrify-htmlcontentextractor
python3-transmogrify-htmlcontentextractor
python3.10-transmogrify-htmlcontentextractor
python3.10dist(transmogrify-htmlcontentextractor)
python3.10dist(transmogrify.htmlcontentextractor)
python3dist(transmogrify-htmlcontentextractor)
python3dist(transmogrify.htmlcontentextractor)
Obsoletes
python-transmogrify-htmlcontentextractor

Document Actions