download links not present on page #38

cnjr2 · 2015-06-21T13:13:19Z

It would be great to be able to download content from a page for which there are no direct links.

For example, at a given webpage (e.g. www.paper.com) there are some links to low resolution images:

<img src="/foo/carousel/bar/image1.jpg" class="figure"></img>
<img src="/foo/carousel/bar/image2.jpg" class="figure"></img>
<img src="/foo/carousel/bar/image3.jpg" class="figure"></img>

I want to get the high resolution version, and I know their location:

www.paper.com/foo/images/bar/image1.jpg
www.paper.com/foo/images/bar/image2.jpg
www.paper.com/foo/images/bar/image3.jpg

I would like to be able to replace carousel by images (with XPath replace() for example) and then just follow the link to download the image:

"figure": {
  "selector": "replace(//img[@class='figure'], 'carousel', 'images')",
  "download": true
}

The text was updated successfully, but these errors were encountered:

tarrow · 2016-09-23T10:39:47Z

I think this is a sub-issue of #16.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

download links not present on page #38

download links not present on page #38

cnjr2 commented Jun 21, 2015

tarrow commented Sep 23, 2016

download links not present on page #38

download links not present on page #38

Comments

cnjr2 commented Jun 21, 2015

tarrow commented Sep 23, 2016