top of page

How to extract data and download files from a URL more efficiently

💡 Do you need to download files from multiple URLs where each URL is a unique download link? You've got you covered.

Problem Description


You need to extract a list of URLs and perform certain actions thereafter) For example, you need to navigate to the following web page:

https://www.i-pex.com/library/white-paper


This page contains a list of URLs containing white papers. You will need to navigate to each link to download the PDF files, which can be tedious when using UI Elements.



Solution


The following flow provides a high level overview of the solution:


Firstly, we will need to extract the list of white papers as a datatable using the action “Extract data from web page”. Remember to specify the store data mode as “Variable”.



While the Extract data from web page action is open, navigate to ‘https://www.i-pex.com/library/white-paper’ browser manually. Right click the first title > Select Extract Element Value > Select Href


Right click the second title > Select Extract Element Value > Select Href


On the Live web helper pop up, it will automatically extract the list of remaining items. You may update the column name if required > Click Finish


Use an "If" action to check if the download directory exists. If it does not, create the directory.


Then, using the action “For each”, we will loop through each of the datarow “CurrentItem“ in the datatable. For each iteration of the loop:

  1. We use the action “Go to web page” to navigate to the URL “%CurrentItem[‘URL’]%

  2. We use the action “Extract data from web page” to extract the file URL. Remember to specify the store data mode as “Variable”.



While “Extract data from web page” action is open, navigate one of the item manually. Right click the title > Select Extract element value > Select Href > Click Finish


We use the action “Download from web” to download the file

> Specify the “URL” as “%Link%

> Configure the “Save response” and “File name”

> Specify the “Destination folder”


Close web browser after the end loop action.







Additional Information


  • Last updated on: 9 Dec 2024

  • Tested version(s): 2.50.00183.24303

  • Prerequisites: Browser (e.g. Chrome)

  • Dependencies: None

  • Known issues: None

References


  • Nil

Robotic Process Automation Singapore

CFB Bots is a leading technology service provider in the fast-growing field of Intelligent Automation. We partner with large enterprises in their Digital Transformation journey and help them and their employees thrive in the Future of Work.

NAVIGATE
CONTACT US
STAY CONNECTED

Singapore

CFB Bots Pte Ltd

Registration No.: 201705263H

77 High Street

#05-09, High Street Plaza

Singapore 179433

T: +65 6909 2099

E: enquiries@cfb-bots.com

Malaysia

CFB Bots Sdn Bhd

Registration No.: 202401045826 (1591672-X)

Suite: 33-01, 33rd Floor

Menara Keck Seng

203 Jalan Bukit Bintang

55100 Kuala Lumpur

T: +60 39 388 0352

E: my@cfb-bots.com

Australia

T: +61 2 8880 5998

E: au@cfb-bots.com

Join our mailing list to get the latest insights on automation

  • LinkedIn Social Icon
  • Facebook Social Icon
  • Twitter Social Icon
  • Instagram Social Icon
  • YouTube Social  Icon
TS Master Logo.png

© 2017-2025 CFB Bots Pte Ltd. All Rights Reserved. Tel: (65) 6909 2099 | Email: enquiries@cfb-bots.com | Privacy Policy | Terms of Use

bottom of page