Many hyperlinks are disabled.
Use anonymous login
to enable hyperlinks.
Overview
Artifact ID: | c78295372cb0597f10a64f23da25eb9deeb04fb9828b732d24595be2cd250691 |
---|---|
Page Name: | scrape-script |
Date: | 2018-07-12 13:55:31 |
Original User: | mario |
Mimetype: | text/x-markdown |
Next | 3cf84fa5c6b322d8d577643aa44870043227b5c64ef3e457dd0f13e9f0b022ef |
Content
scrape script
- This firstly generates an URL list from the IA search API to only retrieve the interesting content pages
- With the extraction script converting from src/* to target/* and populating an open fossil repo right away