You are here: Home / LBN / Up2date / Plone and Zope / BastionLinux 36 / python2-transmogrify-htmlcontentextractor-1.0-1.lbn36.noarch

python2-transmogrify-htmlcontentextractor-1.0-1.lbn36.noarch

Package Attributes
RPM  python2-transmogrify-htmlcontentextractor-1.0-1.lbn36.noarch.rpm Architecture  noarch Size  696442 Created  2023/03/22 03:19:22 UTC
Package Specification
Summary This blueprint extracts out title, description and body from html either via xpath or by automatic cluster analysis
Group Unspecified
License GPL
Home Page http://github.com/djay/transmogrify.htmlcontentextractor
Description

Introduction Helpful transmogrifier blueprints to extract text or html out of html content. transmogrify.htmlcontentextractor.auto This blueprint has a clustering algorithm that tries to automatically extract the content from the HTML template. This is slow and not always effective. Often you will need to input your own template extraction rules. In addition to extracting Title, Description...

Requires
rpmlib(PayloadFilesHavePrefix)  
rpmlib(PayloadIsZstd)  
rpmlib(CompressedFileNames)  
rpmlib(PartialHardlinkSets)  
rpmlib(FileDigests)  
Provides
python2-transmogrify-htmlcontentextractor

Document Actions