Use Cases
- Extract PDF URLs from a website's sitemap for content management.
- Automate the collection of PDF documents for SEO audits.
- Filter sitemap URLs to gather specific file types for reporting.
How It Works
Manually trigger the workflow to start the process. Set the sitemap URL to fetch the XML data. Make an HTTP request to retrieve the sitemap content. Convert the XML data into JSON format for easier manipulation. Split the URLs from the JSON data for further processing. Filter the URLs to retain only those ending with '.pdf'.
Setup Steps
- 1Import the workflow template into your n8n environment.
- 2Edit the 'Set sitemap URL' node to input the desired sitemap URL.
- 3Run the workflow to extract and filter the PDF URLs.
Apps Used
HTTP Client
XML Parser
Categories
Target Roles
Industries
Tags
#process automation
#document automation
#workflow management